Epitope tag and method for detection and/or purification of tagged polypeptides

ABSTRACT

Provided herein is a novel epitope that can be used as a tag in methods for rapid and effective characterization, purification, and subcellular localization of polypeptides of interest, which comprise the tag. The tag is specifically recognized by an epitope specific antibody, which can be used to detect, capture, quantify, and/or purify polypeptides of interest that are tagged with the epitope. Also provided is novel epitope specific antibody.

This application is a continuation application of U.S. patentapplication Ser. No. 15/352,913, filed Nov. 16, 2016, now issued as U.S.Pat. No. 10,273,265, which is a continuation application of U.S. patentapplication Ser. No. 14/996,903, filed on Jan. 15, 2016, now issued asU.S. Pat. No. 10,215,166 issued on Nov. 13, 2018, and claims priority toEuropean Patent Application No. 15194838.7, filed on Nov. 16, 2015, allof which are incorporated herein by reference in their entireties.

FIELD OF THE INVENTION

Provided herein are compositions and methods related to the field ofbiomedical research, biochemistry and cell biology. A novel epitope isprovided that can be used as a tag for use in rapid and effectivecharacterization, purification, and in vivo localization of polypeptidesof interest that compose the tag. The tag is specifically recognized bya novel epitope specific antibody, which allows antibody-taginteraction.

BACKGROUND

In the post-genomic era the field of proteomics has grown dramatically.For a multitude of applications, ranging from mass spectrometry analysisto high-content imaging, affinity-based assays are indispensable.Affinity-based assays rely on the detection ofprotein-protein-interaction (PPI) between a receptor and a ligand, suchas an antibody and an antigen. In the case of the monitoring of anantigen-antibody-interaction the assay is also called an immunoaffinityassay. Immunoaffinity assays are the method of choice for testing theidentity, quantity, and/or location of a polypeptide of interest.However, there are cases in which no suitable antibody is available.This disadvantage of immunoaffinity-based assays can be overcome by amethod called “epitope tagging”, wherein a protein is tagged with anepitope, i.e. the binding part of an antigenic protein.

Epitope tagging is a technique in which a known epitope is fused to arecombinant protein by means of genetic engineering. By choosing anepitope for which an antibody is available, live technique allows thedetection of proteins for which no antibody is available. Since the late1980s epitope tagging has become a standard molecular genetics methodfor enabling rapid and effective characterization, purification, and invivo localization of the protein products of cloned genes.

In the early days of proteomics the first commercially available tagswere originally designed for protein purification. Examples of theseearly tags are FLAG. 6×HIS and the glutathione-S-transferase (GST)system. The 6×HIS tag relies on metal affinity and the GST system relieson affinity of GST to glutathione. FLAG is one of the first epitope tagsused commercially.

Later on, the discovery of fluorescent protein reporters such as greatfluorescent protein (GFP) made it possible to detect proteinsintracellularly without the need of a secondary reagent. In this case,proteins of interest were tagged with the full-length protein sequenceof GFP, rendering the tagged protein of interest fluorescent.

However, the problem with tags comprising full-length proteins such asGST maltose-binding protein (MBP) or GFP, is that they sometimesstoically interfere with subcellular protein localization or folding,which may compromise or abrogate the native function of the protein tobe analyzed.

Therefore numerous small peptide-based epitope-tags such as c-myc, V5,HA, CBP or FLAG hate been developed. Such tags have either a syntheticorigin (FLAG) or are derived from viral (HA, V5) or endogenous mammalian(c-myc, CBP) proteins. They are characterized by a size of 8-26 aminoacid residues and are detected by classical IgGs (poly- or monoclonal).One problem with tags derived from endogenous proteins is the fact thatthe tag-specific antibody generally also binds to the endogenousprotein. If the interaction between the tag and the antibody is notspecific, the assay may give false positives. Due to the competitionbetween tagged protein and endogenous protein as binding partners forthe antibody, the assay will be less efficient.

Although many immunoaffinity capture systems are available usingtag-specific antibodies, there are still severe problems due to lowaffinity binding, unspecific interactions, batch to batch variations orreduced functionality of the antibodies upon covalent coupling to solidsurfaces. Furthermore, a specific problem of known immunoaffinitydetection and/or capture systems is their dependence on conventionalantibodies evolved by the vertebrate immune system to detect theepitope-tagged protein.

SUMMARY

Provided herein is an immunoaffinity detection and/or capture system,which is characterized by high affinity, specificity andreproducibility.

Also provided is an epitope specific antibody, which can specifically,reliably, and reproducibly interact with an epitope tag. Nucleic acidsencoding the antibody or fragments thereof are also provided. Forexample, provided herein is a nucleic acid encoding SEQ ID NO: 7, SEQ IDNO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12 and orSEQ ID NO: 13. Further provided is an epitope tag which can interactspecifically, reliably, reproducibly, and with high affinity with theepitope specific antibody. For example, the epitope tag can be anisolated epitope peptide consisting of or comprising from about 8 toabout 25 amino acids, wherein the amino acid sequence consists of orcomprises a sequence as defined in SEQ ID NO: 1 (RX₄X₅AX₇SX₉W), whereinX₄ can be K or a substitution, wherein X₅ can be A or R or aconservative substitution of A or R; wherein X₇ can be V or aconservative substitution of V, and/or wherein X₉ can be H or aconservative substitution of H. Nucleic acids encoding the isolatedepitope peptides are also provided.

Also provided is an isolated epitope peptide consisting of or comprisingfrom 12 to 25 amino acids, wherein the amino acid sequence consists ofor comprises a sequence as defined by SEQ ID NO:32(X₁X₂RX₄X₅AX₇SX₉WX₁₁X₁₂), wherein X₁ can be P or A, wherein X₂ can be Dor a conservative substitution of D, wherein X₄ can be K or asubstitution, wherein X₅ can be A or R or a conservative substitution ofA or R, wherein X₇ can be V or a conservative substitution of V, whereinX₉ can be H or a conservative substitution of H, wherein X₁₁ can be Q ora conservative substitution of Q and/or wherein X₁₁ can be Q or aconservative substitution of Q.

Also provided are the components for an immunoaffinity-based assay,which is suitable to detect a polypeptide of interest reliably,specifically and efficiently. This assay can be used for cellularimaging and or direct antigen detection.

Also provided is an immunoaffinity-based assay, which is suitable toreliably, specifically and efficiently purify a polypeptide of interest.

Furthermore, provided herein is a system for capture and/or detection,in particular a system that can be used for different types of analysis,for combined analysis by microscopic and biochemical studies, amongothers.

Further provided is a robust purification method for recombinantproteins that allows the use of non-denaturing or denaturing conditions.

DESCRIPTION OF THE FIGURES

FIG. 1 shows the structural analysis of BC2-nanobody (BC2-Nb) and aBC2-Nb/BC2T complex.

(a) shows the BC2-Nb/BC2T backbone interactions. The BC2-peptide (BC2T)folds into a β-strand that is part of a β-sheet structure formed by thecomplementarity determining region 3 (CDR3) and framework regions 2 and3 Shown are 13 backbone-backbone hydrogen bonds (black dashed lines) ofwhich one is mediated by water. The CDR3 contributes eight and theframework regions five of these interactions. The CDR1 and CDR2 are notparticipating in binding.

(b) shows the so called “Deadlock” interaction A charge-mediatedinteraction between Arg106 of the CDR3 and Glu44 in FR2 is stabilizingthe BC2-Nb BC2T complex. These amino acid side chains are reaching overthe peptide, forming a salt bridge that locks the peptide into itsbinding site.

(c) shows specific BC2-Nb/BC2T interactions. In addition to the backboneinteractions, a small number of interactions mediated by side chainsgenerate specificity for the BC2T peptide sequence BC2T residue W10 isinvolved in a CH-π interaction with Cys50 (dotted line). A watermolecule is bound by peptide amino acids S8, Tyr109, two carbonyl groupsand one amine group. The charge-mediated interaction between Arg106(CDR3) and the side chain of Glu44 (FR2), reaching over the peptide, isalso shown.

FIG. 2 shows identification of residues that mediate BC2T bindingspecificity. BC2T positional sequence variant libraries were incubatedwith BC2-Nb (black bars), BC2-Nb_(R106S) and BC2-Nb_(R106E) immobilizedon sepharose (n=3). After precipitation the supernatants were subjectedto liquid chromatography followed by mass spectrometry analysis.Specifically precipitated sequence variants were identified bycalculating the ratio of signals produced by the peptides in thesupernatant of BC2-Nb or indicated mutants and the supernatant of anon-BC2T-related control Nb (GFP specific Nb). Shown are results for thesequence variant libraries of (a) BC2T_(R3X), (b) BC2T_(A6X), (d)BC2T_(S8X), and (d) BC2T_(W10X) (see Table 1 for full results). Variedpositions are indicated as an X in the BC2T sequence. Amino acids aregrouped according to the physicochemical properties of their side chain:non-polar, polar, and charged. Gamma-amino butyric acid was used insteadof cysteine in the library synthesis Means and standard deviations(s.d.; error bars) of three independent experiments are shown.

FIG. 3 shows one-step purification of BC2-tagged proteins using the BC2nanotrap.

(a) For immunoprecipitation soluble proteins fractions of bacteriallysates either expressing GFP with a C-terminal BC2 tag (GFP_(BC2T)) orsolely GFP were incubated with the BC2-Nb immobilized on agarose (BC2nanotrap), Input (I), non-bound (NB) and bound fractions (B) wereseparated by SDS-PAGE and visualized either by Coomassie Blue (top) orby immunoblot analysis (bottom).

(b) The BC2 nanotrap efficiently binds its epitope under harshconditions. GFP_(BC2T) derived from bacterial extracts was incubated atincreasing concentrations of SDS (0-2%). Urea (0-4 M) or guanidinium bydrochloride (0-3 M) and precipitated as described in (a). Shown are theInput (I) and bound fractions at indicated conditions.

(c) BC2-tagged proteins are efficiently released using alkaline pH orpeptide elution. GFP_(BC2T) bound to the BC2 nanotrap was subjectedeither to elution with sodium thiocyanate (NaSCN, 1-3 M), acidic elution(0.2 M glycine pH 1-2.5), alkaline elution (pH 10-12) or peptide elution(0-1 mM). Aliquots of released (R) and bound (B) fractions were analyzedby immunoblotting with an anti-GFP antibody.

(d) The BC2 nanotrap binds BC2-tagged proteins from human cell lysatesirrespectively whether the BC2 tag is located at the N- or theC-terminus. For immunoprecipitation of BC2-tagged proteins solubleprotein fractions of HEK293T cells expressing _(BC2T)eGFP, eGFP_(BC2T)or solely eGFP were incubated with the BC2 nanotrap as described in (a).Input (I), non-bound (NB) and bound fractions (B) were separated bySDS-PAGE and visualized either by Coomassie Blue staining (top) orimmunoblot analysis (bottom).

FIG. 4 shows results of immunocytochemistry experiments using afluorescently labeled BC2 nanobody.

(a) shows a schematic representation of BC2-tagged fusion proteins andfluorescently labeled BC2-Nbs used for co-localization studies. The BC2Tsequence was genetically fused to the C-terminus of mCherry-Vimentin(mCherry-VIM_(BC2T)) and eGFP-PCNA (eGFP-PCNA_(BC2T))

(b) HeLa cells ectopically expressing mCherry-VIM_(BC2T) oreGFP-PCNA_(BC2T) were fixed either with methanol or PFA, respectively,followed by staining with the indicated fluorescently labeled BC2-Nbsand DAPI. Scale bar 25 μm.

FIG. 5 shows the structure of unliganded BC2-Nb and the correspondingBC2-Nb/BC2T complex.

(a) shows a ribbon drawing of the unliganded BC2-Nb structure. The fourcysteine residues forming two disulfide bonds are marked CDR3, whichcontributes contacts with the peptide in the liganded structure andundergoes an amino acid flip upon binding, is highlighted.

(b) shows a superposition of unliganded and liganded BC2-Nb structures.Comparison of the two structures reveals a flip of 180 degrees by twoamino acids. In the BC2-Nb structure. Arg106 is interacting with thecarbonyl group of Glu108 and Tyr107 is involved in a cationπ-interaction with Arg45. The β-carbon of Arg106 is orientated towardsand Tyr107 away from the Nb. In the peptide bound complex structure itis the other way around. Arg106 is involved in the “headlock” bindingand Tyr107 is forming a hydrogen bond to the carbonyl group of Arg104.

FIG. 6 shows a scheme of the interactions of the BC2-peptide with theBC2-Nb.

The BC2-peptide is shown with side chains. Its acetylated N-terminus isat the top and the amidated C-terminus is at the bottom. All polarinteractions within 3.5 Å are represented with dotted black lines, withtheir interaction partners from the BC2-Nb for backbone and for sidechain interactions. Water molecules are shown. Relevant hydrophobicinteractions within 4.0 Å are represented with dotted lines.

FIG. 7 shows that mutation of the headlock-motif leads to increasedoff-rates of the BC2 nanobody. For surface plasmon resonancespectroscopy (SPR)-based affinity measurements. GFP with a C-terminalBC2-tag (GFP_(BC2T)) (a-c) or solely GFP (d) was immobilized on aCM5-chip Kinetic measurements were performed by injecting sixconcentrations of BC2-Nb (a), BC2-Nb_(R106S) (b) or BC2-Nb_(R106E) (c)ranging from 8 nM-250 nM. The obtained data sets were evaluated usingthe 1:1 Langmuir binding model with mass transfer. As a control. BC2-Nbwas tested for binding to GFP only (d) The obtained affinities expressedin terms of the dissociation constant (K_(D)), and association (k_(on)))and dissociation rate constants (k_(off)) determined for BC2-Nb and thecorresponding mutants are summarized in Table 3.

FIG. 8 shows that proteins captured by the BC2 nanotrap can be elutedunder native conditions.

GFP_(BC2T) derived from bacterial lysate was bound by the BC2 nanotrapand subjected to elution using buffer conditions as described in FIG. 3c. The fluorescence signal intensity of released GFP_(BC2T) wasvisualized and quantified using a laser scanner Upper panel shows thefluorescent intensity of eluted GFP_(BC2T) in comparison to the completelysate. Lower panel: Quantification of GFP-fluorescence obtained fordifferent elution conditions. GFP-fluorescence obtained from untreatedlysate was set to 1. Means and s.d. (error bars) of three independentexperiments are shown (R1-R3).

FIG. 9 shows that BC2-Nb is functional for immunoprecipitation anddetection of BC2-lagged proteins.

(a) For immunoprecipitation soluble protein fractions of HEK293T cellseither expressing eGFP-PCNA (control) or BC2-tagged eGFP-PCNA(eGFP-PCNA_(BC2T)) (left panel) or mCherry-Vimentin (mCherry-VIM,control) or BC2-lagged mCherry-VIM (mCherry-VIM_(BC2T)) were incubatedwith the BC2 nanotrap. Input (I), non-bound (NB) and bound fractions (B)were separated by SDS-PAGE and visualized either by immunoblot analysisusing anti-PCNA or anti-Vimentin antibodies (upper panel). As loadingcontrol, blots were probed with an anti-GAPDH antibody (lower panel).

(b) For Western blot detection using fluorescently labeled B2-Nb(BC2-Nb_(AF488)) indicated amounts of the input fractions (as shown in(a)) were subjected to SDS-PAGE and immunoblotting. The Western blotswere probed with BC2-N₄₈₈ followed by detection with anti-PCNA (leftpanel) or anti-Vimentin (right panel) antibodies. As loading controlblots were probed with an anti-GAPDH antibody (lower panel).

(c) shows that the BC2 nanotrap precipitates only minor amounts ofendogenous catenin compared to overexpressed BC2-tagged proteins.Samples as described in (a) of FIG. 3d were subjected to immunoblotanalysis with an anti-β-catenin antibody. Arrows indicate β-cateninspecific signals.

FIG. 10 shows results obtained by immunocytochemistry usingfluorescently labeled BC2 nanobody.

(a) shows confocal imaging of mCherry-Vimentin_(BC2T) with theBC2-Nb_(AF488). HeLa cells ectopically expressing mCherry-VIM_(BC2T)were fixed with methanol, followed by staining with fluorescentlylabeled BC2-Nb and DAPI. Shown is a maximum projection image (z-stack of7 planes) of the cell in the lower left corner of the upper pane) inFIG. 4 b. Scale bar 10 μm.

(b) shows specificity of BC2-Nb_(AF488) and BC2-Nb_(ATTO647). HeLa cellsectopically expressing untagged mCherry-VIM or eGFP-PCNA were fixed withmethanol or PFA, respectively, and stained with the indicatedfluorescently labeled BC2-Nbs and DAPI. Scale bar 25 μm.

FIG. 11 shows a variation of the BC2 tag. The original BC2T sequence(PDRKAAVSHVVQQ: SEQ ID NO:4) was modified in 50 percent of allpositions, resulting in sequence PVRSAALSQWSS (BC2Tmut; SEQ ID NO:5).

Both the original and the modified version of the tag were C-terminallyfused to GFP (GFP_(BC2T); GFP_(BC2Tmut)). GFP without the BC2 tag wasused as negative control. The proteins were expressed in E. coli. Lysatewas prepared and 100 μg total lysate was immunoprecipitated with BC2nanobody immobilized on agarose. Input (I), non-bound (NB) and bound (B)fractions were analyzed by SDS-PAGE followed by Coomassie staining(upper panel) or immuno-blotting using an anti-GFP antibody (lowerpanel).

FIG. 12 shows length variations of the BC2 tag. The original BC2Tsequence (PDRKAAVSHWQQ; SEQ ID NO:4) was shortened, resulting insequence PDRKAAVSHW (BC2T-10; SEQ ID NO:14) and RKAAVSHW (BC2T-8; SEQ IDNO:3). The shortened and original versions of the tags were C-terminallyfused to GFP (GFP_(BC2T-10); GFP_(BC2T-8), GFP_(BC2T)). As negativecontrol, GFP without the BC2 tag was used. The proteins were expressedin E. coli. Lysate was prepared and 100 μg total lysate wasimmunoprecipitated with BC2 nanobody immobilized on agarose beads. Input(I), non-bound (NB) and bound (B) fractions were analyzed by SDS-PAGEfollowed by Western blotting using an anti-GFP antibody. The resultsshow that the BC2T-10 sequence (PDRKAAVSHW; SEQ ID NO:14) or the BC2T-8sequence (RKAAVSHW) (SEQ ID NO: 3) as C-terminal tag are efficientlyrecognized as the 12 amino acid of the original BC2T (PDRKAAVSHWQQ) (SEQID NO: 4) by the epitope specific BC2 nanobody.

FIG. 13 shows the results of a microsphere-based sandwich immunoassayusing β-catenin-specific Nbs as capture molecules Nbs were immobilizedon microspheres and incubated with increasing β-catenin concentrationsranging from 0.25 μg/ml to 2 μg/ml. Bound protein was detected with ananti-β-catenin antibody. Background level of control Nb was set to 1.Shown are mean signal intensities of three independent replicates ±s.d.Binding values of the five best Nbs are highlighted.

FIG. 14 shows that β-catenin-specific nanobodies show high bindingsensitivities. Nbs were covalently immobilized on microspheres andincubated with serial dilutions of β-catenin ranging from 0.2 ng/ml to 1μg/mL Bound protein was detected with an anti-β-catenin antibody. Shownare mean signal intensities of three independent replicates ±s.d.

FIG. 15 shows the results of affinity measurements of BC2-Nb. BC2-Nb wasused for surface plasmon resonance spectroscopy (SPR) measurementsagainst immobilized β-catenin. SPR sensograms of BC2-Nb are shown. TheNb injection time was 180 s, followed by a dissociation lime of 300 s.The data was evaluated using the software Bia evaluation 4.1 and the 1:1Langmuir binding model.

FIG. 16 shows the results of domain mapping of β-catenin binders.Microsphere-immobilized Nbs were incubated with GST-fusion constructscomprising full-length β-catenin or indicated domains. Capturedβ-catenin constructs were detected with domain-specific antibodies. Fordirect comparison, fluorescence intensities obtained withdomain-specific antibodies were normalized to signals obtained with ananti-GST-antibody.

FIG. 17 shows the results of epitope mapping of BC2-Nb.

-   -   (A) shows the identification of the epitope recognized by BC2.        In a peptide screen 29 overlapping 15-mer peptides covering the        N-terminus from aa 1-127 of β-catenin were immobilized on        microspheres with varying IDs per peptide and incubated with        biotinylated BC2-Nb. Peptide-bound BC2-Nb was detected with        streptavidin-phycoerythrin (PE) solution. The Myc-peptide        (EQKLISEEDL) (SEQ ID NO: 53) was used as negative control (neg        CTR).    -   (B) shows the determination of the minimal epitope of BC2-Nb.        Incubation of serial N- and C-terminally truncated peptides        covering aa 14-27 of β-catenin with biotinylated BC2-Nb        MEPDRKAAVSHWQQ (SEQ ID NO: 56), MEPDRKAAVSHWQ (SEQ ID NO: 57),        MEPDRKAAVSHW (SEQ ID NO: 58), MEPDRKAAVSH (SEQ ID NO: 59),        MEPDRKAAVS (SEQ ID NO: 60), EPDRKAAVSHWQQ (SEQ ID NO: 61),        PDRKAAVSHWQQ (SEQ ID NO: 4), DRKAAVSHWQQ (SEQ ID NO: 62),        RKAAVSHWQQ (SEQ ID NO: 63), KAAVSHWQQ (SEQ ID NO: 64), AAVSHWQQ        (SEQ ID NO: 65). Peptide-bound BC2 nanobodies were detected with        streptavidin-phycoerythrin (PE). Amino acid residues comprising        the minimal epitope of BC2-Nb are highlighted. Shown are mean        signal intensities of three independent replicates ±stds    -   (C) shows that phosphorylation of the epitope abolishes binding        by BC2-Nb. Binding analysis of BC2-Nb to a peptide representing        aa 15-29 with (+) or without (−) a phosphorylated Ser23 (phos        Ser23) EPDRKAASVSHWQQQS (SEQ ID NO: 66) was performed as        described in (B). Columns represent mean signal intensities of        three independent experiments ±stds.

FIG. 18 shows two variations of the BC2 tag with improved bindingaffinities over the wildtype BC2 tag. Biolayer interferometry(BLI)-based affinity measurements were used to assess two improvedvariations of the original BC2T sequence (a: PDRVRAVSHWSS, pTag1 SEQ IDNO:33, b. ADRVRAVSHWSS, pTag2 SEQ ID NO:34) in comparison with theoriginal BC2T sequence (c: BC2T SEQ ID NO:4) BC2T and improvedvariations thereof were synthesized as peptides with an N-terminalbiotin attached through an ethylene glycol linker and immobilised onStreptavidin biosensors. Binding kinetics were analysed by incubatingimmobilised peptides with 4-5 concentrations of BC2-Nb (5-50 nM) andevaluated using a 1:1 binding model. The obtained dissociation constants(K_(D)), association (k_(off)) and dissociation rate constants (k_(off))are summarised in Table 5.

FIG. 19 shows the application of two improved variations of the BC2 tagto protein purification and immunoprecipitation.

(a) Protein purification: The BC2 nanotrap was incubated with theprotein mCherry tagged N-terminally or C-terminally with the originalBC2 tag BC2T (SEQ ID NO:4) or improved variations thereof (pTag1 SEQ IDNO:33 and pTag2 SEQ ID NO:34). Subsequently, bound protein was elutedusing the corresponding free peptide at a concentration of 100 μM.Fractions of input (I), non-bound protein (NB), protein released bypeptide elution (R) and protein still bound after elution (B) wereanalysed using SDS-PAGE and Coomassie staining.

(b) Immunoprecipitation: The BC2 nanotrap was used to precipitate theprotein mCherry fused N-terminally with the BC2 tag variation pTag1 SEQID NO:33 from lysates of the human cell line HEK293T, the Trichoplusiani insect cell line High5 and the yeast Saccharomyces cerevisiae.Fractions of input (I) and bound protein (B) were analysed usingSDS-PAGE and Coomassie staining.

Definitions

In this description and the claims, reference will be made to a numberof terms which shall be defined to have the following meanings:

The term “epitope peptide” or “epitope tag” shall refer to a peptidesequence that is used as a tag; both terms are used interchangeably. Anepitope peptide is any peptide that comprises at least the 8 amino acidsequence as defined in SEQ ID NO: 1 (RX₄X₅AX₇SX₉W), wherein X₄ can be Kor a substitution: wherein X₅ can be A or R or a conservativesubstitution of A or R; wherein X₇ can be V or a conservativesubstitution of V, and wherein X₉ can be H or a conservativesubstitution of H. An epitope tag can also consist of SEQ ID NO: 1(RX₄X₅AX₇SX₉W), wherein X₄ can be K or a substitution; wherein X₅ can beA or R or a conservative substitution of A or R; wherein X₇ can be V ora conservative substitution of V, and wherein X₉ can be H or aconservative substitution of H. Isolated epitope peptides are alsoprovided wherein, when the isolated peptide comprises at least 9 aminoacids, the peptide does not comprise an amino acid sequence as definedby SEQ ID NO: 3 (RKAAVSHW) The epitope peptide or epitope tag can be anisolated sequence or can be part of a sequence.

The term “epitope specific antibody” shall refer to an antibody that isspecific to the epitope provided herein and can also be referred to astag specific antibody, or epitope tag specific antibody, these terms canbe used interchangeably.

The term “polypeptide” refers to any type of polypeptide or protein andcomprises any amino acid sequence wherein amino acids are connected viapeptide bonds, such as an amino acid sequence of at least 10 amino acidsin length, with the proviso that the construct of epitope tag andpolypeptide is not β-catenin lire term polypeptide is used in thisapplication very generally and shall comprise also peptides, such asoligopeptides. The terms polypeptide and protein are usedinterchangeably.

The term “polypeptide of interest” or “protein of interest” shall referto any type of polypeptide or protein having a function or property,such as a physiologically active molecule and comprises all structurevariants and sequence variants that have a desired property or function.In particular, the term shall encompass any type of peptide and proteinas well as to a fusion protein. Both terms aa used interchangeably.

The term “fusion protein” or “fusion polypeptide” shall refer to aprotein or a polypeptide created through the joining of two or morepolypeptides or at least one polypeptide and at least one peptide or atleast one polypeptide and at least one oligopeptide. For example, a“fusion protein” can refer to a polypeptide comprising the amino acidsequence of a protein fused with the amino acid sequence of an epitopepeptide. The fusion protein or fusion polypeptide can comprise one ormore than one polypeptide and one or more than one epitope peptide.Moreover, the fused peptides can be joined via a bridge or linkersequence. The epitope peptide is preferably fused to the N- orC-terminal end of the amino acid sequence of the protein, eitherdirectly or via a linker molecule.

A linker is any unit that connects two parts of a molecule, such as twonucleic acids, two peptides or proteins, a tag and a protein ofinterest, or a nucleotide, peptide or protein and a carrier. Any unitthat is known for this purpose can be used as long as it does notinterfere with the function of the molecule. The linker can be comprisedof nucleotides when the connected members are nucleic acids, for example3 to 150 nucleotides, or can be one amino acid or an amino acid sequencehaving 2 to 50 amino acids, when at least one of the two parts is apeptide or protein, or another bridge molecule.

A variant of an epitope peptide of the present invention is a peptide,comprising at least 8 amino acids that on positions 1, 4, 6, and 8 hasamino acids 1, 4, 6 and 8 of SEQ ID NO:3, whereas the remaining aminoacids can be amino acids of SEQ ID NO:3 and at least one of those isanother amino acid that regarding size, polarity and/or charge, issimilar to the “original” amino acid of SEQ ID NO:3. In other words, theterm “variant” as used in this description refers to a sequence that isderived from or corresponds to SEQ ID NO:3 wherein at least 12.5% and upto 25, or 37.5, or 50% of the amino acids are substituted bysubstitutions, like conservative substitutions. For example, a lysinecan also be substituted by a serine without changing the function of thepeptide. A variant of the present invention can be an epitope peptide asdefined above wherein 4 out of 8 amino acids of the sequence of SEQ IDNO:3 can be substituted without substantially changing the function ofthe peptide. A variant of an epitope peptide of the present inventionhaving 8 amino acid residues comprising 3 conservative substitutions,resulting in a 37.5% conservatively substituted variant has beat shownto be active in the examples.

The epitope tag of the present invention being located at the C-terminalor N-terminal end means that no other amino acid residues derived fromthe sequence of the polypeptide of interest are preceding the sequenceof the epitope tag at the N-terminal end or following the sequence ofthe epitope tag at the C-terminal end. Sequences not derived from theoriginal sequence of the polypeptide of interests, such as further tagsor linker sequences, can however fed low or precede the C- or N-terminaltag of the present invention.

The terms “purification” or “purify” when used in the presentapplication shall refer to any type of physical separation of a chemicalor biochemical entity of interest from a material, such as a biochemicalmaterial. An example is the purification of a polypeptide from abiochemical mass like a tissue or broth Purified entities can also bereferred to as isolate Protein or polypeptide purification can compriseone step or a series of steps intended to isolate or enrich one or a fewproteins from a sample. Purification is vital for the characterizationof the function, structure and interactions of the protein of interest.Purification can comprise to separate protein from non-protein parts ofa sample or to separate a desired protein from other proteins. Theresult of purification can be an isolated protein or a sample whereinthe polypeptide is enriched. In the present application the term“purification” encompasses all types of separation and encompassesisolation as well as enrichment.

The term “sample” refers to a specimen or a quantity that is used forany type of determination or purification method. The sample comprises atag and/or protein of interest, it can be a liquid or a solid. In thecontext of the present invention a sample can be often a complex mixturecomprising the protein of interest, such as a cell, a tissue or celllysate. A sample can also be a supernatant obtained by centrifugation ofa liquid comprising a cell, where the cell is capable of secreting theprotein of interest into the liquid. A sample can also be a specimen ofany type of body fluid like blood, urine, liquor, sweat, tears, etc.

The term “secondary binding partner” refers to compounds that canspecifically or unspecifically bind to a lagged polypeptide, an epitopespecific affinity ligand such as an antibody or a complex formed by bothas a primary binding partner, either directly or via a unit provided atthe polypeptide or the affinity ligand such as an antibody. One examplefor a binding pair is a constant part of an epitope specific antibodyand an Fc specific antibody. Another example for a binding pair is abiotin moiety present on the tagged polypeptide or the epitope specificantibody and a streptavidin. A further example as secondary bindingpartner is an antibody against the complex. The secondary bindingpartner can be used for isolating the complex and/or for detecting thecomplex. For this purpose the secondary binding partner can carryimmobilizable groups, delectable groups, etc.

DETAILED DESCRIPTION

Provided herein is an epitope peptide sequence, which is defined as SEQID NO:1. It has been surprisingly found that a peptide sequencecomprising at least the eight amino acids of SEQ ID NO:3 (RKAAVSHW) orvariants thereof are useful as an epitope tag. This amino acid sequenceis sufficient for specific binding as the affinity of its antibody tothe epitope is high enough to ensure specific and reliable interactionwith the tag. Furthermore, it has been found that a system for captureand/or detection based on the epitope peptide has very desirableproperties. For example it has been shown that epitope-lagged proteinsare bound by an epitope specific antibody with a K_(D) of ˜1.4 nM whichtranslates to a ˜10-100 fold higher affinity compared to the systemsavailable in the prior art, such as FLAG, HA, c-myc or thenanobody-derived EPEA-systems. The epitope peptide provided herein isvery versatile; it can be used as C-terminal or N-terminal tag and showsstrong binding to the corresponding antibody under mild or harshconditions. One or more tags can be used, which can be the same ordifferent as long as at least one of the tags is an epitope tag providedherein, for example and not to be limiting, an epitope tag consisting ofor comprising SEQ ID NO: 3 or SEQ ID NO: 3 with one or moresubstitutions, and which can be located on different ends of thepolypeptide or which can be arranged tandem-like or in any other order.In particular the system for detection and/or capture shows unusuallystrong affinity and binding efficiency because of its unique bindingcharacteristics.

Moreover, it has been surprisingly found, that only four amino acids ofSEQ ID NO: 3 (R at position 1, A at position 4. S at position 6, and Wat position 8 with reference to the amino acid residues in SEQ ID NO:3),are essential for specific binding of a tag specific antibody and thatthe remaining positions can be substituted such as by another amino acidthat regarding size, polarity and or charge is similar to the “original”amino acid, and preferably is substituted conservatively, withoutsignificantly negatively impacting the binding specificity and affinit.For example, these substitutions can be one or more non-naturallyoccurring amino acid substitutions. It has been found that an epitopetag comprising, preferably conservative, substitutions at up to 50% ofthe amino acids comprised in the polypeptide as defined in SEQ ID NO:4still ensures efficient and reliable interaction of the epitope specificantibody with the lugged proteins. It has also been found that peptideshaving additional amino acids on one or both ends provide for efficientand reliable interaction with a epitope specific antibody. As anexample, a modified tag or epitope peptide having the amino acidsequence PVRSAALSQWSS (SEQ ID NO:5) was used to efficiently purifytagged GFP protein (see FIG. 11). Further examples of a modified tag orepitope peptide consist of or comprise an amino acid sequence as definedby SEQ ID NO:33 (PDRVRAVSHWSS), or consist of or comprise an amino acidsequence as defined by SEQ ID NO: 34 (ADRVRAVSHWSS).

It has been found that the epitope tag provided herein can comprisefurther amino acids flanking the central tag sequence. Although tagsequences with a length of more than 25 amino acids could be used, it isbeneficial to provide a smaller tag in order to minimize interference ofthe tag with the subcellular locations, interference with folding, orinterference with functions of the polypeptides or proteins fused withthe tag. Thus, the epitope peptide comprising the tag can be an epitopepeptide that does not comprise more than 25 amino acids. For example,the epitope peptide can comprise about 8, 9, 10, 11, 12, 13, 14, 15, 16,17, 18, 19, 20, 21, 22, 23, 24 or 25 amino acids or consist of about 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25amino acids. The epitope peptide can also consist of 25 amino acids orless.

For example, and not to be limiting, the epitope peptide or tag consistsof or comprises from about 8 to about 25 amino acids, wherein the aminoacid sequence consists of or comprises a sequence as defined in SEQ IDNO: 1 (RX₄X₅AX₇SX₉W),

wherein X₄ can be K or a substitution:

wherein X₅ can be A or a conservative substitution of A;

wherein X₇ can be V or a conservative substitution of V, and

wherein X₉ can be H or a conservative substitution of H.

In some examples X₄ can be a conservative substitution of K or S.

In another example, the epitope peptide or tag consists of or comprisesfrom about 8 to about 25 amino acids, wherein the amino acid sequenceconsists of or comprises a sequence as defined in SEQ ID NO: 1(RX₄X₅AX₇SX₉W).

wherein X₄ can be K or a substitution;

wherein X₅ can be A or a conservative substitution of A:

wherein X₇ can be V or a conservative substitution of V, and

wherein X₉ can be H or a conservative substitution of H, wherein, whenthe isolated epitope peptide comprises at least 9 amino acids, it doesnot comprise an amino acid sequence as defined by SEQ ID NO:3(RKAAVSHW). In some examples, X₄ can be a conservative substitution of Kor S.

In any of the epitope peptides described herein, one or more of theamino acid substitutions can be a non-naturally occurring substitution.It is understood that any of the epitope peptides provided herein cancomprise one or more substitutions described herein. For example, theepitope peptide can consist of or comprises a sequence as defined in SEQID NO:1 (RX₄X₅AX₇SX₉W), with one or more substitutions at positions X₄,X₅, X₇ and X₉ as described herein. In another example, the epitopepeptide can consist of or comprises a sequence as defined in SEQ IDNO:32 (X₁X₂RX₄X₅AX₇SX₉WX₁₁X₁₂), with one or more substitutions atpositions X₁, X₂, X₄, X₅, X₇, X₉, X₁₁, or X₁₂, as described herein Inanother example, the epitope peptide consists of or comprises from about12 to 25 amino acids, wherein the amino acid sequence consists of orcomprises a sequence as defined in SEQ ID NO:32 (X₁X₂RX₄X₅AX₇SX₉WX₁₁X₁₂)

wherein X₁ can be P or A;

wherein X₂ can be D or a conservative substitution of D;

wherein X₄ can be K, or a substitution;

wherein X₅ can be A or R, or a conservative substitution of A or R;

wherein X₆ can be V or a conservative substitution of V;

wherein X₇ can be H or a conservative substitution of H; and

wherein X₁₁ and X₁₂ can be Q or a conservative substitution of Q.

In some examples, X₄ can be a conservative substitution of K or S.

In another example, the epitope peptide consists of or comprises fromabout 12 to 25 amino acids, wherein the amino acid sequence consists ofor comprises a sequence as defined in SEQ ID NO:32(X₁X₂RX₄X₅AX₇SX₉WX₁₁X₁₂)

wherein X₁ can be P or A;

wherein X₂ can be D or a conservative substitution of D;

wherein X₄ can be K or a substitution:

wherein X₅ can be A or R, or a conservative substitution of A or R;

wherein X₇ can be V or a conservative substitution of V;

wherein X₅ can be H or a conservative substitution of H; and

wherein X₁₁ and X₁₂ can be Q or a conservative substitution of Q,wherein the isolated epitope peptide dews not comprise an amino acidsequence as defined by SEQ ID NO:3 (RKAAVSHW). In some examples, X₄ canbe a conservative substitution of K or S.

One or more of the amino acid substitutions can be a non-naturallyoccurring substitution.

In another example, the epitope peptide consists of or comprises anamino acid sequence as defined by SEQ ID NO:4 (PDRKAAVSHWQQ) or avariant thereof as defined above.

A “conservative substitution” refers to the substitution of one aminoacid by another, wherein the replacement results in a silent alteration.This means that one or more amino acid residues within the epitopepeptide sequence can be substituted by another amino acid of a similarpolarity which acts as a functional equivalent. Substitutes for an aminoacid within the sequence may be selected from other members of the classto which the amino acid belongs (i.e. a conservative substitution). Forexample, one polar amino acid can be substituted by another polar aminoacid, one positively or negatively charged amino acid, respectively, canbe substituted by another positively or negatively charged amino acid,respectively, et cetera. Classes of amino acids are for example,nonpolar (hydrophobic) amino acids including alanine (A), leucine (L),isoleucine (I), valine (V), proline (P), phenylalanine (F), try ptophan(W) and methionine (M); polar neutral amino acids including glycine (G),serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N),and glutamine (Q); positively charged (basic) amino acids includingarginine (R), lysine (K) and histidine (H); negatively charged (acidic)amino acids including aspartic acid (D) and glutamic acid (E).

The epitope peptide is provided in isolated form, which means that isnot comprised within the body of an animal or a human being. The epitopepeptide can be provided as isolated peptide or it can be used inconjugated, linked, bound, or connected form, like in a fusion proteinwherein at least one epitope is fused to a polypeptide, in particular aprotein or polypeptide of interest, or in a construct which comprises atleast one epitope provided herein and a polypeptide, optionally linkedor connected by a linker, spacer or other connecting member. The linker,spacer or other connecting member can be any sequence as is known in theart. It can for example be a sequence providing for a cleavage site, orcan just be a spacer, i.e. a member that adjusts the distance betweenepitope and polypeptide, or a connecting member contributing a desirablefunction.

It has been surprisingly found that this epitope peptide can beefficiently find reliably used as an epitope tag, which can interactspecifically and efficiently with an epitope specific or tag specificantibody.

“Epitope lagging” is a technique in which a known epitope is fused to apolypeptide, such as a recombinant protein, for example by means ofgenetic engineering. An epitope tag can be used in combination with anantibody specific for this epitope tag for detecting polypeptides suchas proteins of interest. To be useful as an epitope tag a peptide shouldprovide a region for selective binding of a specific antibody, the tagregion should be available for binding even if the tag is linked to orfused with a polypeptide and unspecific binding should be avoided as faras possible. The binding between the epitope tag and the correspondingantibody should be strong, reliable and selective. By choosing asuitable combination of an epitope tag and a corresponding antibody, thetechnique makes it possible to detect proteins for which no antibody isavailable. This is especially useful for the characterization of newlydiscovered proteins and proteins of low immunogenicity.

Also provided is a system comprising a pair of an epitope and anantibody, that can be used for a number of experimental applications,such as Western blot analysis, immunoprecipitation,immuno(histo)chemistry, immunofluorescence studies, protein-proteininteraction studies, ELISA, and affinity purification.

Despite being derived from an endogenous protein, β-catenin, it has beensurprisingly found that proteins fused with the tag, are specificallyand reliably detected with a epitope specific antibody withoutsignificant influence from any interaction of the antibody withendogenous protein Without being held to this theory it is contemplatedthat this is due to the endogenous β-catenin being pan of a proteincomplex forming the so-called adherens junctions, and therefore notbeing available for an efficient interaction with the epitope specificantibody. Furthermore it has been found that the epitope specificantibody BC2-Nb has an increased affinity to the epitope tag (1.4 nM)compared to the affinity of the BC2-Nb to endogenous β-catenin (3.7 nM),suggesting that binding of the antibody to tagged polypeptides isfavored to binding of the antibody to endogenous β-catenin.

Polypeptides, such as polypeptides or proteins of interest, which havebeen fused or are linked with or are connected with the epitope tag canbe specifically and reliably detected using a tag-specific antibody, asfor example described below in detail. Tagged polypeptides can also bespecifically, efficiently and reliably purified by an immunoaffinitybased capture assay.

Also provided herein is a construct comprising a polypeptide and atleast one epitope peptide, as defined above, at the N-terminal orC-terminal end of the polypeptide. In the constructs provided herein,the epitope peptide can consist of or comprise any epitope peptide aminoacid sequence set forth herein, for example, SEQ ID NO: 1 SEQ ID NO: 3,SEQ ID NO: 4. SEQ ID NO: 5 SEQ ID NO: 32. SEQ ID NO: 33 or SEQ ID NO: 34as defined herein. Therefore, any polypeptide provided herein, includinga polpeptide in a construct, can have at least one epitope peptideconsisting of or comprising. SEQ ID NO: 1. SEQ ID NO: 3. SEQ ID NO: 4,SEQ ID NO: 5 SEQ ID NO: 32. SEQ ID NO: 33 or SEQ ID NO: 34, as definedherein, at the N-terminal or C-terminal of the polypeptide. The term“construct consisting of or comprising a polypeptide and at least oneepitope peptide” refers to a non-naturally occurring polypeptide, whichcan also be called a “fusion protein”. The polypeptide comprised in theconstruct can comprise at least a protein of interest and optionally alinking sequence, it can also comprise further functional sequencesand/or further tags. The construct can have one or more epitope tagsprovided herein. If the construct includes more than one tag, those canbe arranged tandem-like, can be consecutive, can be spaced or can be atopposite ends. The construct in addition can comprise further tags whichcan be at tiny useful position.

Also provided is a nucleic acid encoding the epitope peptide providedherein. The nucleic acid can be RNA, DNA, PNA, or LNA.

Nucleic acid constructs comprise a nucleic acid sequence encoding atleast one epitope peptide provided herein. These nucleic acid constructscan be of prokaryotic or eukaryotic origin, such as of bacterial,mammalian, yeast, fungal, nematode, fish, avian, viral, or insectorigin. Optionally, the nucleic acid construct also comprises thenucleic acid sequence of the polypeptide to be tagged, such as theprotein of interest. The nucleic acid sequence encoding the epitopepeptide can be downstream and/or upstream of the nucleic acid sequenceencoding the polypeptide to be tagged resulting in a C-terminally and orN-terminally tagged polypeptide. It has been surprisingly found, thatthe epitope peptide can be used C-terminally as well as N-terminally.This allows very versatile use of the epitope peptide or the nucleicacid coding it, respectively. The construct can comprise a nucleic acidcoding for one epitope peptide or for more copies of the epitopepeptide, such as two or three copies. The more than one copy can beadjacent to the terminal tag or tags can be at the N- and C-terminalsrespectively.

The construct can also be a nucleic acid expression construct, andcomprises at least one nucleic acid encoding an epitope tag and at leastone nucleic acid encoding a polypeptide to be tagged, wherein both canbe positioned directly adjacent to each other, or can be separated by alinker sequence of an appropriate length, such as about 3-150nucleobases. Other functional sequences can also be included. Forexample, the linker can comprise a recognition site for an endonucleaseor a nucleic acid sequence that encodes a recognition site for aprotease. Examples for proteases are TEV protease, thrombin, factor Xaprotease, or a PreScission protease.

The nucleic acid construct or nucleic acid expression construct canfurther comprise nucleic acid sequences, which encode further tags, suchas a HIS-tag.

The nucleic acid construct or nucleic acid expression construct cancomprise a nucleic acid sequence, which encodes more than one tag, forexample a tandem tag. A tandem tag is the combination of two tags insequence. A tandem tag can comprise at least two copies of an epitopetag provided herein, or at least one epitope tag provided herein and atleast one other tag. The construct can also comprise a number of same ordifferent tags if appropriate.

Further provided is a host cell comprising a nucleic acid or nucleicacid expression construct provided herein. The “host cell” is a cellthat comprises the nucleic acid sequences encoding the epitope tagprovided herein. The host cell can be a stable transfectant or it can betransiently transfected with the nucleic acid construct comprising anucleic acid sequence(s) encoding the epitope tag. The host cell can bea prokaryotic or a eukaryotic cell. For example the host cell can be abacterial, yeast, insect, mammalian, plant, fungal, nematode, fish, oravian cell. The cell can be a primary cell or a cell line. The host cellcan be an individual single cell, or can be a cell that is pan of atissue.

Also provided is a method for introducing the nucleic acid or thenucleic acid expression construct into a host cell. The introductionresults in the nucleic acid encoding at least one epitope tag providedherein being connected to a nucleic acid encoding at least a polypeptideof interest in the host cell. Therefore, the host cell comprises thenucleic acid encoding at least one epitope tag provided herein and will,upon expression, comprise the epitope tag.

The method for introducing the nucleic acid into the host can be anymethod known in the art for this purpose, in particular the method canbe selected from the group comprising CRISPR/Cas genome editing, genomeediting methods using Zinc finger nucleases (ZFNs),transcription-activator like effector nucleases (TALFNs), ormeganucleases, reagent-based methods using reagents such as cationiclipids, calcium phosphate, or DEAE-dextran, transduction, transfection,and instrument-based methods such as electroporation, microinjection andlaserfection.

Any of the epitope peptides provided herein can be used as an N-terminalor C-terminal epitope tag. Host cells that comprise the nucleic acidencoding the epitope tag will express a polypeptide, such as apolypeptide protein of interest, which is linked with, or fused to theepitope tag. Depending on the sequence of the nucleic acid expressionconstruct and or on the method of introducing the nucleic acid encodingthe epitope tag, the introduction into the host will result in anN-terminally or C-terminally tagged polypeptide C-terminal or N-terminaltags are preferred compared to internal tags, as the terminal tags willbe more easily accessible for interaction with a epitope specificantibody. The epitope peptide of the present invention can however alsobe used as an internal tag, provided that the tagged protein of interestexhibits a conformation, wherein the internal tag is accessible forinteraction with a binding partner. For example, the protein of interestcan exhibit loop structures, wherein the internal tag is part of thesequence forming the loop structure. In this case, the internal tag willbe accessible for interaction with binding partners.

As used throughout, the term “affinity ligand” refers to molecules thatare capable of binding with very high affinity to either a moietyspecific for it or to an antibody raised against it. Examples includebiotin (ligand)-streptavidin (moiety), digoxigenin(ligand)-anti-D1G-antibody and further tag specific antibodies. As usedherein, the term “antibody” comprises monoclonal antibodies, polyclonalantibodies, particularly polyclonal monospecific antibodies (i.e.antibodies with different variable regions, which however all recognizethe specific epitope tag provided herein), chimeric antibodies, as wellas a fragment or variant of the above listed types of antibodies. Theterm “antibody” herein furthermore comprises genetically manipulatedantibodies, and in a nonlimiting example, the term “antibody” refers tothe heavy chain antibodies such as found in Camelidae, for example, in acamel or a llama. The binding elements of these antibodies consist of asingle polypeptide domain, namely the variable region of the heavy drainpolypeptide (VHH). These antibodies are naturally devoid of light-chainswith the heavy chain variable domain forming the completeantigen-binding site. In contrast, conventional antibodies have bindingelements comprising two polypeptide domains (the variable regions of theheavy chain (VH) and the light chain (VL)). The lack of dependence oninteraction with a light chain variable domain for maintainingstructural and functional integrity gives these VHH domains asubstantial advantage over other small antibody fragments, in terms ofease of production and behavior in solution. In particular, VHHfragments are the preferred types of molecules for immuno-affinitypurification, because of their unusual stability and their ability torefold efficiently after complete denaturation, which frequently occursduring elution of antigen. Heavy chain antibodies are also calledsingle-domain antibodies or single-chain antibodies. Fragments of heavychain antibodies are also termed “nanobodies”. Fragments of antibodiesare well-known in the art and any fragment that has epitope bindingactivity can be used as an “antibody” in the systems or for theinteraction provided herein.

The term “antibody” as used herein, typically refers to full-lengthantibodies and to antibody fragments of the aforementioned antibodies aswell as variants as defined below. Antibodies, that do not contain allthe domains or regions of a full-length antibody, are fragments ofantibodies which are also provided herein. Thus, the term “antibody”shall encompass any type of antibody, fragments and variants thereof,and mixtures of antibodies, fragments, and/or variants.

Any antibody that has affinity for and is specific for an epitopepeptide provided herein and that provides for a high avidity or affinitycan be used. It can be a conventional antibody or a heavy chainantibody, or a fragment of a conventional antibody or of a camelidantibody, it can also be a mixture of antibodies, fragments and/orvariants. Preferred antibodies or antibody fragments are derived fromheavy chain antibodies, such as camelid antibodies, and camelid antibodyfragments, which are called nanobodies or Nb.

All of the afore mentioned antibodies may be present in bound or solubleform and may comprise a detectable moiety, or “label” (for examplefluorescence markers, radioactive isotopes, colloidal gold marker,coupled enzymes, etc.), and or may carry a peptide, group or linker forimmobilization on a solid phase. The term “bound” refers to bothmembrane-bound antibodies and antibodies immobilized to a solid supportor carrier material. The term “soluble” in the context of antibodiesrefers to antibodies that are not bound to a membrane or solid support,as is well understood by the skilled person.

Further provided is a construct comprising or consisting of apolypeptide and at least one epitope peptide at its N-terminal orC-terminal end as defined above and in the claims. The polypeptide canbe any polypeptide as outlined above and can for example at leastcomprise a protein of interest and optionally further functionalsequences, for example further tags. The construct can include one ormore epitope tags, as explained above.

This construct or tagged polypeptide can be captured and/or detected bymeasuring the interaction of a tag specific affinity ligand, such as atag specific antibody, which can comprise a detectable moiety and/or canbe bound or bindable to a solid support, with the tagged polypeptide.The construct or tagged polypeptide of interest can be captured with thetag specific affinity ligand, such as the tag specific antibody forpurification or enrichment, and the tagged polypeptide optionally canthen be detected and/or quantified by known means, such as a secondarydetection ligand, which preferably comprises a detectable moiety, suchas a fluorescent label, and interacts with the polypeptide of interest,or, preferably, with the tag specific affinity ligand, such as the tagspecific antibody.

For detection the tagged polypeptide and or the tag specific affinityligand, such as the specific antibody can comprise a detectable moietyor a moiety that allows introducing a detectable moiety or a moietyproducing some kind of signal. Detectable moieties and methods fordetecting and/or quantifying a moiety signal are well-known in the art.It is also possible to enrich the polypeptide of interest and to lateranalyze binding partners, amount and other properties of interest of thetagged polypeptide.

Therefore, provided herein is a method for detection and/or capture, forexample a method for purifying a polypeptide by capture and/or a methodfor detecting a polypeptide by capturing and/or measuring a detectablesignal, or by enriching a polypeptide and analyzing the polypeptide inthe obtained composition.

The detection methods provided herein are suitable to determine thepresence, subcellular localization and/or amount of a polypeptidecomprising the epitope peptide or epitope tag provided herein. Themethod can be an in vivo, or an in vitro method.

The tagged polypeptide or protein to be detected, located and/orquantified can be detected at its intracellular location in a host cell,for example in the cell nucleus, in cell membranes or another cellcompartment. The tagged polypeptide or protein to be detected, and/or tobe quantified can also be detected in a solution comprising the laggedpolypeptide or protein, for example a cell lysate obtained from a hostcell, or a tissue comprising the host cell.

In a first step of a detection method affinity ligand, such as anantibody specifically binding the epitope peptide is administered to asample comprising the tagged polypeptide or protein. The sample can be ahost cell, a tissue, a solution comprising cell lysate of a host cell orany other sample that comprises the tagged polypeptide, such as asupernatant, for example, obtained after centrifugation of a liquidcomprising the host cell, wherein the host cell is capable of secretingthe polypeptide of interest into the liquid or another specimen like abody fluid.

This administration step is carried out at conditions that allowspecific interaction of the affinity ligand, such as the antibody and usepitope tag. Such conditions are well known to the person of skill inthe art. Washing steps typically follow the administration of anantibody to its antigen, and the skilled person knows how and when toapply said washing steps.

In one example, the tagged polypeptide or protein is detected,quantified and or located by detecting the interaction of the affinityligand, such as the antibody and its epitope tag. The detection can bedone as is known in the art. In another example, at least one or bothinteracting partners, usually the epitope specific affinity ligand, suchas the antibody, comprises a detectable moiety.

Upon administration to the sample, such as a host cell or solutioncomprising cell lysate, or cell supernatant, the affinity ligand, suchas an antibody, will specifically interact with the lagged protein. Thisinteraction can be detected, monitored and quantified by measuring orobserving the reporter signal obtained from the detectable moiety. Forexample, if the detectable moiety is a fluorescent label, fluorescencecan be measured and observed upon excitation.

In another example, the detection method is carried out by using:

an epitope specific affinity ligand, such as an antibody, which does notcomprise a detectable moiety;

at least one secondary binding partner, which can bind to the epitopespecific affinity ligand, such as the antibody, the tagged protein, orto the complex built by both. The secondary binding partner can comprisea detectable moiety and/or can be immobilizable.

In a first step, an epitope specific affinity ligand, such as theantibody is administered to a sample, such as a host cell or a solutioncomprising a host cell lysate as disclosed above. In a second step, asecondary binding partner can be administered to the sample, such as thehost cell or a solution comprising a host cell lysate comprising thetagged polypeptide or protein bound to the epitope specific affinityligand, such as the antibody.

Presence, amount and or localization of the tagged polypeptide orprotein can be detected or determined by measuring or observing areporter signal obtained from a delectable moiety comprised in thesecondary binding partner.

The advantage of a two-step detection method using two types of bindingpartners is that the tag specific interaction is separated from theactual detection step. This allows the tag specific affinity ligand,such as the antibody, to remain unchanged, as it does not need tocomprise a detectable moiety. This could enhance its specificity oraffinity compared to a tag specific affinity ligand, such as anantibody, comprising a detectable moiety, as the detectable moiety couldinfluence the interaction of the tag and its affinity ligand, such as anantibody Thus, the reliability and efficiency of the detection methodcould be enhanced as well. Furthermore, using the tag specific affinityligand, such as an antibody, simply as a capture affinity ligand, suchas a capture antibody and not as a capture and detection affinityligand, such as a capture and detection antibody allows separation ofthe capture and the detection steps if only presence and amount of thetagged protein or polypeptide is to be determined. Therefore, the firststep using the epitope tag specific affinity ligand, such as a tagspecific antibody could be followed by an isolation or enrichment step,yielding the captured tagged protein or polypeptide of interestIsolation or purification steps are discussed further below. Thedetection step could then be carried out on the isolated and or enrichedtagged protein, leading to enhanced reliability of the obtainedquantification and an easier handling of the detection step.

Suitable biophysical or biomolecular detection methods for qualitativelydetecting the epitope tag/antibody interaction comprise any suitablemethod known in the an. Such methods include, without being limitedthereto, methods as applied for qualitative or quantitative assays, e.g.for Enzyme-linked Immunosorbent Assay (ELISA), ELISPOT assay. WesternBlot or immunoassays. Such methods comprise e.g. optical, radioactive orchromatographic methods, preferably when using any of the above labels,markers or linkers, more preferably fluorescence detection methods,radioactivity detection methods, Coomassie-Blue staining, silverstaining or other protein staining methods, electron microscopy methods,methods for staining tissue sections by immunohistochemistry or bydirect or indirect immunofluorescence, etc. Such methods may be appliedcither with the antibody or may involve the use of further tools, e.g.the use of a secondary binding partner, specifically binding to a partof the tagged polypeptide, the antibody, or the complex.

Depending on the size of the used antibody, the subcellular localizationof the tagged polypeptide or protein of interest can also be determined.For example, if the detection antibody is small enough, distinctsubcellular structures such as the intermediate filamentous network oran essential part of the replication machinery can be visualized andmonitored.

For example, the detection antibody can be a nanobody, becausenanobodies can interact with proteins at subcellular localizations, suchas structures deeply embedded in chromatin, due to their decreased sizecompared to conventional antibodies. Due to the absence of the variablelight chain, nanobodies only possess three hypervariable loops(complementarity determining regions, CDRs) compared to six CDRs presentin conventional antibodies. The three CDRs are flanked by 4 frameworkregions (FR) To compensate for the loss of CDRs. Nbs exhibit distinctivefeatures regarding their CDR structure and antigen binding mode. Inorder to provide a sufficiently large antigen-interacting surface of600-800 Å [1], most nanobodies exhibit substantially elongated CDR3loops [2]. In some cases, the increased flexibility of such long loopsis counteracted by fastening them to the Nb core with an additionaldisulfide bond [3].

A typical nanobody thus can be schematically displayed as an amino acidsequence comprising the following regions:

FR1-CDR1-FR2-CDR2-FR3-CDR3-FR4

An example of a nanobody provided herein is the BC2 nanobody (BC2-Nb)that shows favorable properties, such as very high affinity, strongbinding, and versatility. The BC2-Nb comprises the variable region of aheavy chain antibody of Camelidae composed of framework region 1, CDR1,framework region 2, CDR2, framework region 3, CDR3, and framework region4, as defined by the following amino acid sequence (122 aa):

QVQLVESGGGLVQPGGSLTLSCTASGFTLDHYDIGWFRQAPGKEREGVSCINNSDDDTYYADSVKGRFTIFMDNAKDTVYLQMNSLKPEDTAIYYCAEARGCKRGRYEYDFWGQGTQVTVSS (SEQ ID NO: 6; bold type designates the CDRs)

Table 1 provides the amino acid sequences of FR1, CDR1, FR2, CDR2, FR3,CDR3 and FR4 of the BC2 nanobody. A nucleic acid sequence encoding SEQID NO: 7, SEQ ID NO 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQID NO: 12 and or SEQ ID NO: 13 is also provided herein.

In one example, the epitope specific antibody provided herein comprisesthe amino acid sequence defined in SEQ ID NO: 6, or comprises an aminoacid sequence dial has at least about 90, 95, or 99% identiy to SEQ IDNO:6. For example, the antibody can comprise an amino acid sequence thathas at least about 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% or anypercentage identity in between these percentages. A nucleic acidencoding SEQ ID NO: 6 or an amino acid sequence that has at least about90, 95, or 99% identity to SEQ ID NO: 6 is also provided herein. Theepitope specific antibodies provided herein exhibit a comparableaffinity to the epitope peptide provided herein compared to the BC2-Nbdefined by SEQ ID NO:6 and are termed “functional variants” of theBC2-Nb nanobody provided herein In other words, provided herein areepitope specific antibodies that have the same epitope specificity asBC2-Nb. Also provided are antibodies having the same epitope specificityas an antibody having complementarity determining regions (CDRs)comprising amino acid sequences SEQ ID NO: 8, SEQ ID NO: 10 and or SEQID NO: 12.

TABLE 1 Regions of amino acid sequence defining BC2-Nb: Region SequencePosition FR1 QVQLVESGGGLVQPGGSLTLSCTAS  1-25 (SEQ ID NO: 7) CDR1GFTLDHYD 26-33 (SEQ ID NO: 8) FR2 IGWFRQAPGKEREGVSC 34-50 (SEQ ID NO: 9)CDR2 INNSDDDTY 51-59 (SEQ ID NO: 10) FR3 YADSVKGRFTIFMDNAKDTVYLQMNSLKPED60-98 TAIYYCAE (SEQ ID NO: 11) CDR3 ARGCKRGRYEYDFW  99-112(SEQ ID NO: 12) FR4 GQGTQVTVSS 113-122 (SEQ ID NO: 13)

It has been found that mainly CDR3 is responsible for the strong bindingof the epitope peptide provided herein. Another important part of theamino acid sequence of BC2-Nb is the cysteine (at position 50 of SEQ IDNO:6) present in FR2. This cysteine forms a disulfide bridge with thecysteine of CDR3 and is thus responsible for the binding competentfolding of the CDR3. Therefore, any nanobody comprising as CDR3 the CDR3sequence of BC2-Nb, and comprising a cysteine in the framework region 2that is capable of forming a disulfide bridge with the cysteine of theCDR3, is suitable as one partner of the epitope tag nanobodyinteraction. In a nonlimiting example, there are about 45 to 55, forexample about 50 or 51 amino acid residues between the cysteine in theFR2 and the cysteine in CDR3. In the case of the BC2-Nb the cysteine ofFR2 is at position 50, and the cysteine in CD3 is in position 102 (theposition markers are referring to the position in SEQ ID NO:6 as shownabove). Therefore there are 51 amino acid residues located between thecysteine in framework region 2 and the cysteine in CDR3.

Thus, the antibody BC2-Nb or a functional variant thereof comprising atleast CDR3 of BC2-Nb and a cysteine in FR2 is an epitope specificantibody. The nanobody BC2-Nb or a functional variant thereof comprisingat least CDR3 of BC2-Nb and a cysteine in FR2 is also an interactionpartner with the epitope tag provided herein, in all of the disclosedmethods of purification and/or detection as well as other uses disclosedin this application, as it has been shown that it is a very reliable andefficient binding partner of the epitope tagged polypeptides of interestBC2-Nb and its functional variants can also be a components) of the kitsdisclosed in this application Provided herein is also the use of theBC2-Nb or a functional variant thereof for detecting, quantifying,determining the subcellular localization of, or purifying a polypeptidecomprising an epitope tag provided herein.

A functional variant of BC2-Nb is any antibody or fragment that has anaffinity for the epitope tag provided herein that is at least 80%, morepreferably at least 90% or at least 95% or even 99% or more than BC2-Nb.The affinity of a variant and of BC2-Nb can be measured as is known inthe art and the results can be compared as is known to the skilledartisan and by well-known assays, for example by surface plasmonresonance (SPR), or by other protein-protein interaction monitoringassays.

Further provided is a purification method, or a capture and purificationmethod. The purification method can be used in analytic,semi-preparative and preparative methods.

Thus, also provided is a method of purifying a polypeptide, such as aprotein of interest comprising at least one epitope tag provided herein.

In this aspect the method comprises a capture step of contacting asample, for example a solution, comprising a tagged polypeptide, with anaffinity ligand, such as an antibody, capable of specifically binding tothe epitope tag. The affinity ligand specifically binding to the epitopepeptide can be an antibody, for example, the nanobody BC2-Nb or avariant thereof, as defined above.

The sample to be contacted with the epitope specific antibody can be anytype of sample comprising a tagged polypeptide and can be processed toseparate the polypeptide Preferably the sample is a solution, forexample a lysate of a host cell, comprising the tagged polypeptide, or asupernatant, such as a supernatant obtainable by centrifugation of aliquid comprising a host cell comprising the construct or laggedpolypeptide, wherein the host cell is capable of secreting or otherwisetransporting the tagged polypeptide into the liquid.

An antibody, a fragment or a variant used in a method for purifying apolypeptide, for example a protein of interest, can be used in solutionor immobilized. To immobilize an affinity ligand such as an antibodyagainst the epitope tag or a fragment or variant thereof, the affinityligand, such as the antibody, fragment or variant, or a mixture thereofcan be bound to a sample carrier, solid support, or matrix. Thisimmobilization step can occur prior to or after the binding of theaffinity ligand, such as an antibody, to the epitope tag. Methods forimmobilizing affinity ligands, such as antibodies, and parts thereof arewell-known to the skilled artisan and any method that allowsimmobilization without impairing binding properties can be used.

If the affinity ligand, for example, an antibody, capable ofspecifically binding to the epitope peptide provided herein is notimmobilized to a solid support, then the method optionally can comprisea further step for isolating the complex, for example by using a bindingpartner for the complex, such as a secondary antibody that is specificfor example for the complex or (in case the epitope specific antibody isan IgG antibody) for the constant part of the antibody. The secondarybinding partner can be in solution or can be immobilized orimmobilizable to a solid support.

The term “solid support” or “matrix” refers to any type of carriermaterial that can be used for immobilization of affinity ligands, forexample, antibodies or parts thereof and it can refer to material inparticulate (e.g. beads or granules, generally used in extractioncolumns) or in sheet form (e.g. membranes or filters, glass or plasticslides, microtitre assay plates, dipstick, capillary fill devices orsuch like) which can be flat, pleated, or hollow fibers or tubes.Suitable and well-known matrices without being exhaustive; are silica(porous amorphous silica), agarose or polyacrylamide supports, ormacroporous polymers. Examples include dextran, collagen, polystyrene,polypropylene, polyvinylchloride, polyacrylamide, methacrylate,celluloses, calcium alginate, controlled pore glass, aluminum, titaniumand porous ceramics, synthetic polymers and co-polymers, latex, silica,agarose, metal, glass, and carbon. Alternatively, the solid surface maycomprise part of a mass dependent sensor, for example, a surface plasmonresonance detector. Conveniently, an array comprising a plurality ofindividual affinity ligands, for example, antibodies or antibodyfragments, which are capable of specifically binding the epitope tagbound or immobilized to a solid surface is provided. This array can beused to capture tagged polypeptides comprised in a solution as soon thesolution is brought in contact with the immobilized affinity ligands,such antibodies or antibody fragments.

In a further step following the capture step, the solid supportcomprising the immobilized epitope specific antibody bound to the laggedpolypeptide optionally is washed to remove unbound and unspecificallybound constituents.

In a further step, the tagged polypeptide can be optionally eluted toobtain the isolated polypeptide, such as the protein of interest.Elution of lagged protein hound to the immobilized antibody can beachieved by methods known in the art. For example, the tagged proteincan be eluted by competitive elution with the isolated epitope peptide.This isolated epitope peptide will then be in competition with thetagged polypeptide to bind the immobilized epitope specific antibody, ifthe isolated polypeptide is added in surplus concentration, the reactionbalance of the binding will be shifted to the binding of the immobilizedantibody with the isolated epitope peptide. This results in the releaseof the tagged polypeptide. The released polypeptide can then optionallybe purified further by method steps known to the skilled person.

The tagged polypeptide can also remain immobilized to the solid support,such as beads, and processed further in downstream application such asmass spectrometry, without the elution step.

In another example, a tagged protein, for example a tagged polypeptidethat comprises a linker with a cleavage site, can be cleaved with anappropriate means, for example a prolease to remove the tag, thereby thepolypeptide is released from the immobilized antibody, and thepolypeptide can be obtained in its native form. For this example, thenucleic acid sequence encoding the polypeptide should not only comprisea sequence encoding the epitope tag but also a sequence encoding alinker with a breakable site, for example a cleavage site recognized bya protease. The release step by enzymatic cleave can replace or followthe elution step.

Also provided is a kit comprising the components necessary to carry outthe methods provided herein.

For example, a kit for capture and/or detection of tagged polypeptidesis provided. The kit is suitable for carrying out a method of captureand/or detection. The kit comprises the following comprises:

-   -   a nucleic acid or a nucleic acid expression construct encoding        an epitope peptide provided herein;    -   an affinity ligand, for example, and not to be limiting, an        antibody capable of specifically binding to the epitope peptide    -   optionally a detectable moiety; and    -   buffers and reagents necessary for the capture and/or detection        methods described herein.

The nucleic acid or the nucleic acid expression construct encoding theepitope peptide is to be introduced into a host cell as described above,and as is known to the skilled person. The kit can in this regard alsocomprise the buffers and reagents necessary to introduce the nucleicacid or the nucleic acid expression construct encoding the epitopepeptide into a host cell.

The affinity ligand capable of specifically binding to an epitopepeptide provided herein can be an antibody, for example, the nanobodyBC2-Nb or a variant thereof as defined above.

The affinity ligand capable of specifically binding to the epitopepeptide, for example, an antibody, can comprise at least one detectablemoiety. The kit can also comprise at least one secondary bindingpartner, that can bind to a unit provided on the complex or one of itscomponents, such as avidin or streptavidin if the complex or one of itscomponents is biotinylated, or secondary and thud antibodies forcapturing the complex built from the tagged polypeptide and the affinityligand, such as an antibody. These further antibodies are, for example,specific for other pans of the polypeptide or the epitope specificaffinity ligand, such as an antibody, or for units provided on thepolypeptide, the affinity ligand, such as an antibody, or the complex,such as the constant part of the primary affinity ligand or antibodycapable of specifically binding to the epitope peptide. The furtherantibodies can also comprise a detectable moiety.

Affinity ligands, for example, antibodies, as defined herein areparticularly useful for detecting, capturing, and/or purifying apolypeptide comprising an epitope peptide provided herein. Therefore,provided herein is the use of an epitope specific affinity ligand, suchas an antibody, as defined herein, in particular a nanobody fordetecting, capturing, and/or purifying a polypeptide comprising theepitope peptide described herein.

The kit can further comprise a solid support comprising the affinityligand, such as an antibody, specifically binding to the epitope peptideimmobilized or attached to the solid support.

The delectable moiety can be any detectable moiety as defined above.Preferably, the detectable moiety is a fluorescent label.

In another example, a kit for capture and purification of taggedpolypeptides is provided.

The kit for purification of tagged polypeptides comprises the followingcomponents.

-   -   a nucleic acid or a nucleic acid expression construct encoding        an epitope peptide provided herein;    -   an affinity ligand, for example, an antibody capable of        specifically binding to the epitope peptide;    -   optionally a solid support; and    -   buffers and reagents necessary for the capture and purification        methods described herein.

The nucleic acid or the nucleic acid expression construct encoding theepitope peptide is introduced into a host cell as described above, andas is known to the skilled person. The kit can also comprise the buffersand reagents necessary to introduce the nucleic acid or the nucleic acidexpression construct encoding the epitope peptide into a host cell.

The affinity ligand, for example, an antibody capable of specificallybinding to the epitope peptide, can be the nanobody BC2-Nb or a variantthereof as defined above.

The affinity ligand, for example, an antibody capable of specificallybinding to the epitope peptide, can be in solution or immobilized orattached to a solid support. The kit can also comprise further bindingpartners, as outlined above. The further binding partner can alsocomprise a detectable moiety.

The solid support can be any solid support as defined above, and asuitable way of immobilization or attachment of the affinity ligand,such as an antibody, to the solid support can be chosen by the skilledperson.

The kit can also comprise reagents suitable to release the capturedtagged polypeptide from the affinity ligand, such as an antibody,capable of specifically binding to the epitope peptide provided herein.Thus, the kit can, for example, comprise an isolated epitope peptideprovided herein. It can also comprise an enzyme, which is capable ofreleasing the captured lagged polypeptide from the affinity ligand, suchas an antibody capable of specifically binding to the epitope peptide bycleaving the tag from the poly peptide. For example, the enzyme can be aprotease such as TEV protease, thrombin, factor Xa protease, or aPreScission protease.

Furthermore provided is also a complex of a construct comprising anepitope peptide and a protein of interest as defined above with anepitope specific antibody. Also provided is such a complex immobilisedon a carrier or support like a bead or a column.

Disclosed are materials, compositions, and components that can be usedfor, can be used in conjunction with, can be used in preparation for, orare products of the disclosed methods and compositions. These and othermaterials are disclosed herein, and it is understood that whencombinations, subsets, interactions, groups, etc. of these materials aredisclosed that while specific reference of each various individual andcollective combinations and permutations of these compounds may not beexplicitly disclosed, each is specifically contemplated and describedherein. For example, if a method is disclosed and discussed and a numberof modifications that can be made to a number of molecules including themethod are discussed, each and every combination and permutation of themethod, and the modifications that are possible are specificallycontemplated unless specifically indicated to the contrary. Likewise,any subset or combination of these is also specifically contemplated anddisclosed. This concept applies to all aspects of this disclosureincluding, but not limited to, steps in methods using the disclosedcompositions. Thus, if there are a variety of additional steps that canbe performed, it is understood that each of these additional steps canbe performed with any specific method steps or combination of methodsteps of the disclosed methods, and that each such combination or subsetof combinations is specifically contemplated and should be considereddisclosed.

Publications cited herein aid the material for which they are cited arehereby specifically incorporated by reference in their entireties.

EXAMPLES

Provided below are the following examples which should not beinterpreted as restricting the scope or spirit of the invention.

Example 1 Overview

By covalently coupling the monovalent BC2-Nb to solid matrices a BC2nanotrap was generated. This serves as a highly efficient pulldownreagent similar to the previously described GFP nanotrap [4] whichbecomes widely used to purify GFP-tagged complexes from cellular lysates[5]. The excellent binding characteristics of the BC2 nanotrap arefavorable for proteomic analysis applying e.g. highly competitivebinding conditions using chaotropic agents (up to 4 M Urea or 1.5 MGdmCl) or denaturing detergents (2% SDS) in the binding reaction [6]. Incontrast to the GFP nanotrap where bound proteins are released onlyunder denaturing conditions, BC2 nanotrap-bound proteins can be elutedin a functional conformation simply by peptide competition using lowamounts of BC2-peptide (0.1-1 mM). Positional cloning revealed that theN- or the C-terminally localized BC2-tag is equally well recognized bythe BC2 nanotrap. For N-terminally lagged GFP an additional band in thenon-bound fraction was observed. This indicates that the BC2-tag mightbe proteolytically removed upon cellular lysis, a phenomenon which hasalready been described for other epitope tags in mammalian expressionsystems [7].

The BC2-based capture and detection system described in this study is anattractive alternative to currently available epitope-tag systems.BC2-Nb binds BC2-epitope-tagged proteins with a K_(D) of ˜1.4 nM andtherefore shows a ˜10-100 fold higher affinity compared to the availableFLAG, HA, c-myc or the nanobody-derived EPEA-systems [8-10].

Recently, nanobodies against β-catenin were generated recognizingdifferent epitopes within the amino acid sequence of β-catenin. Theseantibodies were used to monitor endogenous β-catenin within cells [11].One of these Nbs, referred to as BC2-Nb, recognizes a short linearepitope corresponding to aa 16-27 of β-catenin.

Here, a detailed structural and biochemical analysis of the BC2-Nb andits interactions with the peptide epitope, hereafter referred to asBC2-tag (BC2T), is provided. Data reveal an unusual binding mechanismmediated by the extended CDR3 and the framework of BC2-Nb, which istermed “headlock binding” Based on the high-affinity binding propertiesof BC2-Nb a novel BC2-tag purification and detection system wasdeveloped and characterized. It is shown dial the immobilized BC2-Nbefficiently captures and purifies BC2-tagged proteins from bacterial andmammalian cell extracts in a natively folded state By co-localizationanalysis, for the first time the detection of peptide-tagged cellularproteins using a fluorescently labeled nanobody could be shown. Thisversatile capture and detection system now enables a unique combinationof biochemical and microscopic analyses of a large variety of proteinscomprising this small, inert peptide-tag.

A nanobody was generated that binds a short linear peptide with veryhigh affinity (K_(D)˜1.4 nM). Structure analysis of theBC2-Nb/BC2-peptide complex revealed an unusual binding mode in which theBC2 peptide forms an additional antiparallel p-strand that inserts intoa p-sheet structure formed by the CDR3 and the framework regions.Numerous backbone hydrogen bonds provide affinity, and a salt bridgebetween Arg106 of CDR3 and Glu44 of FR2 embraces the bound peptide in aheadlock-like fashion. A comparison with all nanobody complexesavailable in the Protein Data Bank (PDB) shows that such a binding modehas not been previously observed in any of the 81 known crystallizedcomplexes. Affinity measurements of headlock-mutated BC2-Nbs show˜10-fold reduced binding affinities and higher off-rates. These dataclearly suggest that the main function of the headlock is in fasteningthe already-bound peptide to the nanobody.

Epitope analysts using positional scanning peptide libraries revealed asmall set of four specificity-determining residues, with the mostcritical being W10. By contrast the other residues do not contributetowards the binding specificity, mostly because they are not engaged incontacts. The reliance on many backbone interactions to generateaffinity and only four amino acid side chains for specificity rendersthe BC2T/BC2-Nb system especially versatile. Most of the BC2T residuescan be replaced without sacrificing binding efficiency, and this allowsa straightforward adaptation of the tag for individual applications,including charge modifications, addition of tryptophan residues forincreased absorbance at 280 nm, or introduction of a protease cleavagesite or even an unusual amino acid for labeling purposes.

Finally, a study shows that the BC2-Nb offers new opportunities forcellular imaging. Basically, nanobodies can be easily modified bysite-directed labelling with the full-range of available organic dyesand applied for direct imaging of cellular structures without the needof a secondary antibody for detection. Recently, it was demonstratedthat GFP and RFP-specific nanobodies labeled with photoactivatedlocalization microscopy (PALM)-compatible organic dyes such asAlexaFluor 647 (AF647) efficiently visualize fluorescent fusion proteinsin high-resolution microscopy. There is still an ongoing demand fornanobodies that recognize a smaller epitope to minimize sterichindrances or potential linkage errors derived from large tags such asGFP or RFP. In this study it was demonstrated for the first time that afluorescently labeled peptide-specific nanobody can be used for thevisualization of distinct subcellular structures such as theintermediate filamentous network or an essential part of the replicationmachinery. Based on these findings it was proposed by the inventors thata BC2-mediated detection system provides unique advantages compared tocurrently available approaches. Firstly, due to its small size (12 aa,1.4 kDa) the BC2-tag does not interfere with the subcellularlocalization or folding of ectopically expressed proteins. Secondly, theusage of a small nanobody (2×4 nm) directly links the detection signalto the corresponding antigen. Thirdly, the BC2 nanobody can be easilymodified e.g. for site directed labeling with appropriate highresolution compatible dyes. In summary, nanobody-mediated labeling ofBC2-tagged constructs now combine a minimal epitope tag with the highphoton yield of organic dyes and minimal linkage error.

Example 2 Structural Basis of Epitope Binding

Since most of the described Nbs have conformational epitopes, themolecular mechanism underlying the observed high-affinity binding wasinvestigated by solving high-resolution crystal structures of BC2-Nbalone (at a resolution of 1.8 Å) and in complex with BC2T (1.0 Å). Fromhere on, BC2T amino acids will be referred to in one-letter code andBC2-Nb amino acids in three-letter code in order to facilitate thepresentation and discussion of the results. The amino acid positions arereferring to the positions of amino acid residues in the epitope tagconsisting of 12 amino acids, as defined in SEQ ID NO:4 (PDRKAAVSHWQQ).

The BC2-Nb adopts the typical immunoglobulin fold, which consists ofnine β-strands forming two (5-sheets connected by loops and by aconserved disulfide bond between Cys22 and Cys96 (FIG. 5 a). Similar toother nanobody structures. BC2-Nb features an especially long CDR3,which contains 14 amino acids and is stabilized by an additionaldisulfide bond formed between Cys102 (located in the CDR3) and Cys50(located in the framework region 2 (FR2)) (FIG. 5 a). The BC2T binds toBC2-Nb in an elongated conformation, inserting into a groove between theCDR3 and the FR2 and FR3 of BC2-Nb. The peptide is integrated into theBC2-Nb structure, forming a strand in a β-sheet (FIG. 1 a). This resultsin a large number of backbone hydrogen bonds that anchor the peptide toneighboring secondary structure elements. Notably, neither CDR1 nor CDR2are involved in the interaction with BC2T (FIG. 1 a, Table 2 below)

TABLE 2 BC2-Nb BC2-Nb-BC2T complex* Data collection Space group P2₁2₁2₁(19) C2 (5) Cell dimensions a, b, c (Å) 31.77, 47.74, 67.97 106.01,31.53, 35.88 α, β, γ (°) 90, 90, 90 90, 107.54, 90 Resolution (Å)39.07-1.80 (1.85-1.80) 50.55-1.00 (1.03-1.00) R_(meas) (%) 8.8 (94.6)6.4 (99.7) I/σI 13.06 (1.45) 18.42 (2.05) CC_(1/2) (%) 99.8 (52.0) 100(52.4) Completeness (%) 99.6 (99.6) 99.6 (98.1) Redundancy 14.4 (3.6)9.5 (8.0) Wilson B (Å²) 27.7 11.5 Refinement Resolution (Å) 39.07-1.8050.55-1.00 No. reflections 70062 580345 R_(work)/R_(free) (%) 19.1/21.712.8/14.9 No. atoms BC2-Nb 902 1037 BC2T — 115 Water 36 134 MPD 8 —B-factor (Å²)) BC2-Nb 27.8 11.2 BC2T — 14.3 Water 32.6 24.3 MPD 38.6 —R.m.s. deviations Bond length (Å) 0.011 0.020 Bond angles (°) 1.1071.452

The unbound and bound BC2-Nb structures are similar, and can besuperimposed with a small overall root mean square deviation (RMSD) ofthe atomic positions 0.4 Å (Phenix structure comparison). The CDR3 loopsin particular have similar orientations, with the exception of two aminoacids (Arg106 and Tyr107) that are flipped almost 180 degrees (FIG. 5b). In the unliganded BC2-Nb, the β-carbon of Arg106 is oriented towardsthe core structure of the Nb, while the Arg106 side chain interacts withthe backbone carbonyl group of Glu108 and with the s-electron system ofTyr109. The β-carbon of Tyr107 is pointing away from the Nb and thearomatic ring is involved in a cation-π interaction with Arg45. In thecomplex, residues Arg106 and Tvr107, are flipped, as their β-carbons andtheir side chains face into opposite directions Arg106 is now involvedin a charge-mediated interaction with the side chain of Glu44 located inFR2 (FIG. 1 b) This unusual interaction, which is termed “headlock” bythe inventors, reaches over the bound peptide and locks it firmly inplace. Several direct and water-mediated interactions fasten the peptidein its binding site (FIG. 1 c). However, only a small subset of BC2Tresidues is involved in these contacts: R3 forms a salt bridge withAsp110, S8 is engaged in a hydrogen bond with the carbonyl group ofLys103 and also complexes a water molecule together with Tyr109, and W10is buried in a hydrophobic pocket where it forms both a CH-π interactionwith Cys50 and a hydrogen bond with the carbonyl group of Cys102 (acomplete overview of all interactions is shown FIG. 6).

The impact of point mutations on the binding properties of the BC2-Nbwas investigated. First, the additional disulfide bond that connectsCDR3 and FR2 was removed by replacing Cys50 with an alanine and Cys102with a serine (BC2-Nb_(C50A_C102S)). Binding studies using modified GFPthat contains BC2T as a C-terminal peptide tag (GFP_(BC2T)) revealedthat this mutation leads to complete loss of binding (data not shown).The most likely explanation for this result is that the disulfide bridgeis required to maintain the structure of the CDR3, which contributesmost of the contacts with BC2T. Next, the contribution of the “headlock”motif towards binding was analyzed Arg106 was replaced with either aserine (BC2-Nb_(R106S)) or a glutamate (BC2-Nb_(R106E)) and surfaceplasmon resonance (SPR) measurements were performed using the GFP_(BC2T)construct. Both mutant proteins yield K_(D) values of ˜11 nM which areabout 10-fold lower compared to BC2-Nb (K_(D): 1.4 nM) (Table 3 below,FIG. 7). Interestingly, the wt and mutant proteins showed similaron-rates, whereas significantly higher off-rates were observed in bothmutants (Table 3). In summary, the modest changes in the affinities areconsistent with the structural analysis. The thirteen backboneinteractions of the extended peptide clearly contribute to the highaffinity of the interaction. The “headlock” most likely helps to securethe bound peptide, as indicated by the lower dissociation rate of the wtprotein.

TABLE 3 K_(D) Nbs R_(max) [RU] [nM] k_(on) [M⁻¹s⁻¹] k_(off) [s⁻¹] χ²BC2-Nb 170 ± 0.34 1.4 ± 0.06 4.6 ± 0.04 × 10⁵ 6.4 ± 0.27 × 10⁻⁴ 3.5 BC2-160 ± 0.48 12.0 ± 0.15  2.8 ± 0.01 × 10⁵ 3.4 ± 0.04 × 10⁻³ 3.8Nb_(R106E) BC2-Nb_(R106S) 160 ± 0.43 9.7 ± 0.13 3.3 ± 0.03 × 10⁵ 3.2 ±0.03 × 10⁻³ 3.1

Example 3 Detailed Epitope Analysis Using Synthetic Positional ScanningPeptide Libraries

Although the high-affinity binding and kinetics can be explained by theobserved structural features, the specificity of complex formation mustlie elsewhere. As only a subset of BC2T residues are involved inspecific, side-chain-mediated contacts with the Nb, the contributions ofthese individual amino acids for binding using positional scanningpeptide libraries were examined. In total, 12 different BC2T librariesdisplaying all 20 proteinogenic amino acids in one single position ofthe peptide were used in immunoprecipitation experiments. The BC2Tlibraries were subjected to liquid chromatography followed by massspectrometry analysis before and after immunoprecipitation withimmobilized BC2-Nb, BC2-Nb_(R106S) or BC2-Nb_(R106E). By determining thepeptides remaining in the non-bound fraction after pulldown, thecomposition of non-bound peptides and correspondingly, the invariableamino acid positions (Table 4, below) were mapped. Specificallyprecipitated peptides were identified by a comparative analysis of thepeptides in the supernatant of BC2-Nb or mutants thereof and thesupernatant of a non-BC2T-related control Nb (GFP-specific Nb). Thedirect comparison with the GFP-Nb made it possible to determine thequantitative degree of peptide capture without the need of labeledstandards. Thus, if no peptide was captured by the BC2-Nb, the ratiogiven in FIG. 2 would be one or close to one. The more peptide wascaptured the more the value approximates 0. It was not possible todiscriminate between isoleucine (I) and leucine (L) peptide versionssince these peptides are isobaric. However, two peptide versionscontaining I/L could be resolved and thus two values are given asresult.

The analysis show s that BC2T residues at positions 1, 2, 4, 5, 7, 9,11, and 12 of SEQ ID NO:4 can be replaced without affecting the epitopebinding properties (Table 4, below). These results are in agreement withthe structure, which shows that the side chains of all eight residuesface away from the Nb. However, the analysis of the supernatants of theR3, A6, S8 and W10 BC2T libraries revealed little cross-reactivities ofthe nanobodies to other amino acids at these positions (FIG. 2 a-d).Notably, bound peptides from the R3 BC2T library (BC2T_(R3X)) haveeither residues with basic properties, i.e. K3 or H3 or residuescomprising smaller side chains, i.e. T3 or V3 (FIG. 2 a). Thisobservation is in agreement with the crystal structure as the saltbridge between R3 and Asp110 could possibly also be formed by H3 or K3.The analysis of the BC2T_(A6X) library revealed preferences of theBC2-Nb to small amino acids at position 6, as cysteine, valine,threonine and serine permutants were efficiently pulled down from thelibrary. Peptides containing glycine, leucine or isoleucine at thisposition were pulled-down in smaller amounts (FIG. 2 b). These resultsare consistent with the crystal structure, where the A6 side chain ispointing towards a small hydrophobic pocket of the BC2-Nb. Binding oflarger amino acids (i.e. leucine, isoleucine) would requireconformational rearrangements, and a glycine would introduce additionalflexibility A similar effect was observed for the BC2T_(S8X)-library.Alanine, cysteine, threonine and valine can replace S8, while otherresidues were captured to a minor degree (FIG. 2 c). The hydroxyl groupof S8 forms a hydrogen bond with the carbonyl group of Lys103 of thenanobody. Cysteine and threonine could engage in somewhat similar,productive interactions, while the valine could at least beaccommodated. The tryptophan at position 10 appears to be an invariablespecificity-determining amino acid in our peptide scanning study, asnone of the W10 permutants were captured by BC2-Nb (FIG. 2 d). This isin excellent agreement with the structural analysis, which shows thatthe W10 side chain inserts into a deep, hydrophobic pocket on thenanobody.

The results of the epitope scanning for the mutants BC2-Nb_(R106S) andBC2-Nb_(R106E) are almost identical with the BC2-Nb, with one notableexception. The analysis of BC2T_(A6X) library revealed that in contrastto BC2-Nb the mutated Nbs precipitated to a small extent also BC2Tvariants that carried a lysine or arginine at position 6 (FIG. 2 b).This observation suggests that mutation of the headlock-forming Arg45 bySer or Glu results in somewhat lower peptide specificity. In summary,four positions in BC2T with limited or no amino acid variability wereidentified. The results are in accordance with the structure, whichshows that the side chains of R3, S8 and W10 are directly involved inBC2-Nb interactions while permutants of A6 are probably stericallydisfavored and may prevent the formation of the headlock binding. Takentogether, this analysis confirms that a small subset of side-chaininteractions help determine the specificity of BC2-Nb for the BC2Tsequence.

TABLE 4 Table 4-1: BC2-Nb (r: ratio; e: P D R K A A ratio error ratioerror ratio error ratio error ratio error ratio error A 0.1 0.0 0.2 0.00.7 0.0 0.2 0.0 0.2 0.0 0.1 0.0 G 0.4 0.0 0.3 0.0 0.9 0.1 0.6 0.0 0.30.1 0.7 0.1 V 0.2 0.0 0.1 0.0 0.3 0.0 0.1 0.0 0.1 0.0 0.3 0.0 IL1 0.40.1 0.1 0.0 0.5 0.0 0.1 0.0 0.1 0.0 0.7 0.0 IL2 0.3 0.1 0.2 0.0 0.5 0.00.2 0.0 0.1 0.0 0.0 0.0 M 0.2 0.0 0.1 0.0 0.7 0.0 0.1 0.0 0.1 0.0 1.00.1 P 0.1 0.0 0.3 0.1 0.5 0.1 1.0 0.1 0.1 0.0 1.1 0.1 F 0.2 0.0 0.2 0.00.5 0.1 0.3 0.0 0.1 0.0 1.0 0.1 W 0.1 0.0 0.2 0.0 0.8 0.1 0.3 0.0 0.00.0 1.1 0.1 S 0.2 0.0 0.1 0.0 0.6 0.0 0.3 0.0 0.1 0.0 0.5 0.0 T 0.5 0.10.1 0.0 0.3 0.0 0.2 0.0 0.1 0.0 0.2 0.0 N 0.6 0.1 0.2 0.0 0.7 0.0 0.50.0 0.3 0.1 1.0 0.0 Q 0.5 0.0 0.1 0.0 0.5 0.0 0.1 0.0 0.2 0.0 1.0 0.0 Y0.1 0.0 0.1 0.0 0.7 0.1 0.2 0.0 0.1 0.0 1.0 0.0 C(Ab₂) 0.1 0.0 0.1 0.00.5 0.0 0.1 0.0 0.1 0.0 0.1 0.0 K 0.3 0.0 0.2 0.0 0.2 0.0 0.1 0.0 0.20.1 1.0 0.1 R 0.2 0.0 0.1 0.0 0.1 0.0 0.1 0.0 0.2 0.2 1.1 0.1 H 0.3 0.10.1 0.0 0.3 0.0 0.1 0.0 0.2 0.0 1.1 0.1 D 1.0 0.1 0.1 0.0 1.0 0.1 1.00.0 0.5 0.1 1.0 0.1 E 0.3 0.1 0.2 0.0 1.0 0.1 0.5 0.0 0.3 0.0 1.1 0.1 VS H W Q Q ratio error ratio error ratio error ratio error ratio errorratio error A 0.4 0.0 0.0 0.0 0.2 0.0 1.0 0.1 0.1 0.1 0.1 0.0 G 0.7 0.10.5 0.2 0.4 0.6 1.2 0.2 0.1 0.1 0.2 0.0 V 0.1 0.0 0.1 0.0 0.2 0.0 1.00.1 0.1 0.0 0.1 0.0 IL1 0.1 0.0 0.4 0.1 0.3 0.0 1.0 0.1 0.1 0.0 0.1 0.0IL2 0.2 0.0 0.7 0.2 0.1 0.0 1.0 0.0 0.1 0.0 0.1 0.0 M 0.1 0.0 0.8 0.20.1 0.6 1.0 0.1 0.1 0.0 0.1 0.0 P 1.0 0.1 0.7 0.2 0.3 0.1 1.1 0.1 0.50.4 0.1 0.0 F 0.2 0.0 0.7 0.2 0.1 0.5 0.0 0.0 0.0 0.0 0.2 0.0 W 0.1 0.00.5 0.3 0.1 1.0 0.1 0.0 0.0 0.0 0.1 0.0 S 0.3 0.0 0.1 0.0 0.2 0.5 1.10.2 0.1 0.0 0.1 0.0 T 0.2 0.0 0.1 0.0 0.2 1.0 1.1 0.1 0.1 0.0 0.1 0.0 N0.2 0.0 0.2 0.1 0.1 0.5 1.0 0.1 0.1 0.1 0.1 0.0 Q 0.4 0.0 0.7 0.2 0.20.0 1.1 0.1 0.1 0.1 0.1 0.0 Y 0.1 0.0 0.5 0.2 0.1 0.0 1.0 0.1 0.0 0.00.1 0.0 C(Ab₂) 0.2 0.0 0.0 0.0 0.2 0.0 1.0 0.0 0.1 0.1 0.1 0.0 K 0.3 0.00.3 0.2 0.2 0.0 1.0 0.1 0.0 0.0 0.0 0.0 R 0.2 0.0 0.5 0.3 0.1 0.0 1.00.1 0.0 0.0 0.1 0.0 H 0.5 0.0 0.7 0.2 0.1 0.0 1.0 0.1 0.1 0.0 0.1 0.0 D0.5 0.0 0.3 0.2 0.1 0.0 1.0 0.1 0.2 0.1 0.6 0.1 E 0.4 0.0 0.7 0.2 0.20.0 1.1 0.2 0.1 0.1 0.4 0.1 Table 4-2: BC2-Nb_((R106S)) (r: ratio; e:error) P D R K A A Ratio error Ratio error Ratio error Ratio error Ratioerror Ratio error A 0.3 0.0 0.4 0.1 0.9 0.1 0.5 0.0 0.4 0.0 0.2 0.0 G0.6 0.1 0.5 0.1 1.0 0.1 0.9 0.1 0.5 0.0 1.0 0.1 V 0.4 0.0 0.2 0.0 0.70.1 0.2 1.0 0.2 0.0 0.5 0.0 IL1 0.7 0.1 0.2 0.0 0.9 0.1 0.4 0.1 0.2 0.00.8 0.1 IL2 0.5 0.4 0.2 0.0 0.8 0.1 0.6 0.1 0.2 0.3 0.8 0.1 M 0.5 0.10.2 0.0 0.9 0.1 0.5 0.3 0.2 0.0 0.8 0.1 P 0.2 0.0 0.3 0.1 1.0 0.1 1.00.1 0.4 0.5 1.0 0.1 F 0.3 0.1 0.3 0.1 1.0 0.1 0.1 0.1 0.2 0.3 0.8 0.1 W0.3 0.0 0.3 0.1 1.0 0.1 0.6 0.1 0.1 0.9 1.0 0.1 S 0.6 0.1 0.1 0.1 0.00.1 0.7 0.0 0.4 0.3 0.6 0.0 T 0.7 0.1 0.2 0.1 0.5 0.1 0.7 0.0 0.3 0.30.4 0.1 N 0.7 0.1 0.3 0.1 1.0 0.0 0.8 0.1 0.5 0.0 1.0 0.1 Q 0.7 0.1 0.20.0 0.9 0.1 0.4 0.0 0.2 0.5 0.9 0.1 Y 0.3 0.0 0.2 0.0 1.0 0.1 0.5 0.00.2 0.0 0.9 0.1 C′ 0.3 0.0 0.1 0.1 0.9 0.2 0.3 0.0 0.3 0.0 0.3 0.0 K 0.50.1 0.2 0.1 0.5 0. 0.2 0.0 0.2 0.0 0.5 0.1 R 0.4 0.1 0.2 0.0 0.2 0.1 0.10.0 0.1 0.0 0.7 0.1 H 0.5 0.1 0.2 0.0 0.7 0.1 0.3 0.3 0.3 0.0 0.9 0.1 D0.6 0.1 0.1 0.1 1.0 0.0 1.0 0.0 0.3 0.1 1.0 0.1 E 0.8 0.1 0.4 0.1 1.00.0 0.8 0.0 0.7 0.0 1.0 0.1 V S H W Q Q Ratio error Ratio error Ratioerror Ratio error Ratio error Ratio error A 0.7 0.0 0.1 0.0 0.5 0.0 1.00.1 0.4 0.1 0.3 0.1 G 0.8 0.0 0.2 0.1 0.5 0.2 1.1 0.2 0.5 0.2 0.4 0.1 V0.3 0.0 0.3 0.0 0.8 0.3 1.0 0.1 0.2 0.0 0.2 0.1 IL1 0.3 0.5 0.6 0.3 0.51.0 1.1 0.1 0.2 0.0 0.2 0.1 IL2 0.4 0.0 0.9 0.1 0.3 1.0 1.1 0.1 0.4 0.50.2 0.1 M 0.5 0.1 1.0 0.1 0.3 1.0 1.1 0.1 0.3 0.1 0.3 0.1 P 1.0 0.1 1.00.1 1.0 1.0 1.1 0.1 0.9 0.0 0.2 0.1 F 0.4 0.0 0.9 0.1 0.2 1.0 1.1 0.10.2 0.5 0.4 0.1 W 0.3 0.0 1.0 0.1 0.1 1.0 0.1 0.9 0.1 0.2 0.5 0.1 S 0.40.0 0.2 0.0 0.5 1.0 1.1 0.2 0.2 1.0 0.2 0.1 T 0.3 0.0 0.1 0.2 0.5 0.61.9 0.0 0.2 0.0 0.3 0.1 N 0.4 0.0 0.7 0.1 0.3 1.0 1.1 0.1 0.3 0.0 0.20.1 Q 0.5 0.0 0.8 0.1 0.4 0.6 1.1 0.2 0.3 0.5 0.2 0.1 Y 0.3 0.0 0.9 0.00.2 1.0 1.0 0.6 0.1 0.0 0.3 0.1 C′ 0.5 0.1 0.1 0.0 0.5 1.0 1.0 0.1 0.30.0 0.2 0.1 K 0.3 0.0 0.9 0.1 0.3 0.6 1.1 0.1 0.1 0.0 0.1 0.0 R 0.2 0.00.9 0.1 0.2 0.0 1.1 0.2 0.1 0.0 0.1 0.0 H 0.4 0.0 0.8 0.1 0.3 0.9 1.10.1 0.2 0.0 0.3 0.1 D 0.9 0.1 0.9 0.1 0.5 0.6 1.0 0.1 0.8 0.0 0.3 0.1 E0.9 0.0 1.0 0.1 0.8 0.1 1.0 0.6 0.5 0.0 0.7 0.1 Table 4-3:BC2-Nb_(R105E)) (r: ratio; e: error) P D R K A A Ratio error Ratio errorRatio error Ratio error Ratio error Ratio error A 0.2 0.0 0.2 0.1 1.00.1 0.4 0.1 0.2 0.1 0.1 0.0 G 0.6 0.0 0.4 0.1 1.0 0.1 0.9 0.1 0.4 0.10.8 0.1 V 0.3 0.0 0.1 0.0 0.5 0.0 0.1 0.0 0.1 0.0 0.3 0.0 IL1 0.6 0.10.1 0.0 0.9 0.2 0.3 0.1 0.1 0.0 0.7 0.1 IL2 0.5 0.2 0.2 0.1 0.9 0.2 0.50.0 0.1 0.0 0.8 0.1 M 0.4 0.1 0.1 0.0 0.9 0.1 0.3 0.1 0.1 0.0 1.0 0.1 P0.1 0.0 0.8 0.1 1.0 0.1 1.0 0.1 0.2 0.0 1.1 0.1 F 0.3 0.0 0.2 0.0 1.00.1 0.6 0.1 0.1 0.0 1.0 0.1 W 0.2 0.0 0.1 0.0 1.1 0.2 0.7 0.1 0.0 0.01.0 0.1 S 0.4 0.0 0.2 0.0 0.8 0.1 0.6 0.1 0.2 0.0 0.5 0.0 T 0.6 0.1 0.10.0 0.6 0.0 0.5 0.1 0.1 0.0 0.3 0.0 N 0.7 0.1 0.2 0.1 0.9 0.0 0.8 0.10.3 0.1 1.0 0.1 Q 0.6 0.1 0.1 0.0 0.6 0.0 0.2 0.0 0.2 0.0 1.0 0.1 Y 0.20.0 0.1 0.0 0.9 0.1 0.3 0.1 0.1 0.0 1.0 0.1 C″ 0.2 0.0 0.2 0.1 0.9 0.10.2 0.0 0.1 0.0 0.1 0.0 K 0.5 0.1 0.1 0.0 0.3 0.0 0.1 0.0 0.0 0.0 0.90.1 R 0.2 0.1 0.1 0.0 0.1 0.0 0.1 0.0 0.0 0.0 0.7 0.3 H 0.4 0.0 0.1 0.00.5 0.0 0.2 0.0 0.2 0.0 1.0 0.1 D 0.8 0.1 0.2 0.1 1.0 0.0 1.0 0.1 0.00.0 1.0 0.1 E 0.8 0.1 0.2 0.1 1.0 0.0 0.9 0.1 0.0 0.0 1.1 0.1 V S H W QQ Ratio error Ratio error Ratio error Ratio error Ratio error Ratioerror A 0.6 0.0 0.1 0.0 0.3 0.0 1.1 0.1 0.3 0.1 0.1 0.0 G 0.7 0.0 0.50.1 0.7 0.0 1.2 0.2 0.4 0.1 0.2 0.0 V 0.2 0.0 0.2 0.0 0.5 0.0 1.0 0.10.2 0.1 0.1 0.0 IL1 0.1 0.0 0.5 0.0 0.4 0.0 1.0 0.1 0.2 0.1 0.1 0.0 IL20.2 0.0 0.9 0.1 0.2 0.0 1.0 0.0 0.4 0.2 0.1 0.0 M 0.3 0.0 1.0 0.1 0.20.0 1.1 0.1 0.2 0.1 0.2 0.0 P 0.9 0.0 1.0 0.1 0.9 0.1 1.1 0.1 0.9 0.10.1 0.0 F 0.2 0.0 0.9 0.1 0.2 0.0 1.9 0.1 0.1 0.1 0.3 0.1 W 0.2 0.0 0.90.1 0.1 0.0 0.1 0.0 0.1 0.0 0.2 0.0 S 0.3 0.0 0.1 0.0 0.4 0.5 0.2 0.20.2 0.1 0.1 0.0 T 0.2 0.0 0.1 0.0 0.4 0.0 1.1 0.1 0.2 0.1 0.1 0.0 N 0.40.0 0.5 0.1 0.2 0.0 1.2 0.0 0.2 0.1 0.1 0.0 Q 0.4 0.0 0.9 0.1 0.3 0.01.2 0.2 0.3 0.1 0.1 0.0 Y 0.2 0.0 0.9 0.1 0.1 0.0 1.0 0.0 0.1 0.0 0.20.0 C″ 0.3 0.0 0.1 0.0 0.1 0.0 1.1 0.1 0.2 0.1 0.1 0.0 K 0.1 0.0 0.9 0.10.2 0.0 1.1 0.1 0.1 0.1 0.0 0.0 R 0.1 0.0 0.9 0.1 0.1 0.0 1.3 0.2 0.10.0 0.0 0.0 H 0.3 0.0 0.9 0.1 0.2 0.0 1.3 0.2 0.2 0.1 0.1 0.0 D 0.9 0.01.0 0.1 0.4 0.0 1.1 0.1 0.5 0.1 0.0 0.1 E 0.9 0.0 0.8 0.1 0.5 0.0 1.20.1 0.4 0.1 0.0 0.0

Example 4 Generation of a BC2T-Based Capture System

Based on the unusual ligand binding mode of the BC2-Nb it was aimed todevelop a BC2T-based affinity system. Purified BC2-Nb were covalentlycoupled to Sepharose beads and an affinity matrix was generated that isreferred to as “BC2 nanotrap”. First, the ability of the BC2 nanotrap toprecipitate BC2T-lagged proteins directly from crude lysate was tested.Therefore, the soluble protein fractions of E. coli cells expressingeither C-terminally lagged GFP (GFP_(BC2T)) or wtGFP (control) wereincubated with the BC2 nanotrap and the input, non-bound and boundfractions were analyzed by SDS-PAGE followed by coomassie staining andimmunoblotting (FIG. 3 a) The obtained data shows that the BC2 nanotrapquantitatively precipitates GFP_(BC2T). Next, BC2T purification wasperformed in the presence of various non-denaturing detergents (NP-40,Triton X100, CHAPS or Tween 20, 0.1-1% w/v) or increasing saltconcentrations (0-500 mM NaCl, 2-50 mM KCl, 2-20 mM MgCl₂). None ofthese reagents appeared to have an impact on binding efficiency (datanot shown). Additionally, antigen binding was tested under denaturingconditions by raising the concentrations of sodium dodecyl sulfate (SDS)or chaotropic agents (GdmCl; Urea) in the binding buffer. It wasobserved that the BC2 nanotrap efficiently precipitates its antigen inthe presence of 2% SDS, 4 M Urea or up to 1.5 M GdmCl (FIG. 3 b). Thisindicates that the BC2 nanotrap remains functionally active under harshconditions.

Although in some cases such harsh binding and elution conditions mightbe favorable to obtain highly pure protein, most of the bound protein ispresumably denatured and does not maintain biological activity. Hence,more gentle elution conditions were tested using MgCl₂ (0.5 M-4 M),sodium thiocyanate (NaSCN, 1-3 M) or pH-mediated release (acidic; pH1-2.5 or alkaline; pH 10-12). Liberation of bound GFP_(BC2T) was alsotested by competitive elution adding increasing concentrations of BC2peptide (PDRKAAVSHWQQ (SEQ ID NO: 4), 0.01-1 mM). Incubation with MgCl₂does not elute GFP_(BC2T) (data not shown), whereas treatment with highconcentrations of NaSCN or acidic elution (pH 1.5) resulted in therelease of 30%-40% of bound protein (FIG. 3 c, upper panels). Incontrast, alkaline elution using higher pH (pH 11 and 12) revealed amore efficient release of 40-80%. Notably, competitive elution washighly efficient as ˜60% and ˜80% of GFP_(BC2T) was detected in elutionfractions after addition of 0.1 mM and 1 mM BC2 peptide, respectively(FIG. 3 c, lower panel). Moreover, whereas the fluorescence of GFP wasdrastically affected upon treatment with NaSCN or acidic pH, alkaline pHor peptide elution yielded fully fluorescent GFP (FIG. 8). These resultsshow that the BC2 peptide can efficiently displace BC2-bound proteins intheir natively folded state.

The BC2-capture system was further analyzed for one-step purification ofrecombinant proteins derived from human cells Specifically, it wasinvestigated whether the terminal position of the BC2-tag has an impacton binding. To this end, a modified GFP comprising the BC2-tag either onthe N-(_(BC2T)eGFP) or the C-terminus (eGFP_(BC2T)) was expressed inhuman embryonic kidney (HEK) 293T cells using untagged eGFP as anegative control. Two days after transfection soluble protein fractionswere generated and subjected to immunoprecipitation using the BC2nanotrap. Input, non-bound and bound fractions were analyzed by SDS-PAGEfollowed by Coomassie staining and immunoblotting (FIG. 3 d). Theresults show that both N- and C-terminally BC2-tagged GFP constructswere efficiently precipitated by the BC2 nanotrap whereas no GFP wasdetected in the negative control A slightly smaller GFP fragmentappeared in the non-bound fraction of _(BC2T)eGFP. Since this band doesnot appear in the bound lane it is hypothesized that it appears due toprotease-mediated removal of the N-terminal BC2-tag during the cellularlysis procedure.

Next, it was tested whether the BC2 nanotrap can precipitate BC2-taggedcellular components of the intermediate filaments or the nuclearreplication machinery BC2T was genetically fused either tomCherry-vimentin (mCherry-VIM_(BC2T)) or eGFP-PCNA (eGFP-PCNA_(BC2T))and both constructs were expressed in HEK293T cells. As controls,corresponding constructs without the BC2-tag were used. Soluble proteinextracts were incubated with the BC2 nanotrap and whole lysate (input),non-bound and bound fractions were analyzed by immunoblotting usinganti-vimentin or anti-PCNA antibodies. The results show that bothmCherry-VIM_(BC2T) and eGFP-PCNAa_(BC2T) were efficiently precipitatedwith the BC2 nanotrap, while no signal was detectable in the boundfractions of the unlagged proteins (FIG. 9 a). The next question waswhether the BC2-Nb can also detect BC2-tagged proteins inimmunoblotting. Hence, fluorescently labeled BC2-Nb were generated bychemically coupling the purified nanobody to the organic dyeAlexaFluor488 (BC2-Nb_(AF488)) and it was used to probe immunoblots withsoluble protein fractions of cells expressing eGFP-PCNA,eGFP-PCNA_(BC2T), mCherry-VIM or mCherry-VIM_(BC2T). The results showthat BC2-NbA_(AF488) is highly specific for BC2-tagged proteins, whereasuntagged proteins were not detected (FIG. 9 b). Finally, it was askedwhether the BC2-Nb also recognizes endogenous β-catenin in the presenceof BC2-tagged proteins. While no signal for β-catenin was detected inWestern blot with the BC2-Nb_(AF488) (FIG. 9 b) minor amounts ofβ-catenin compared to the overexpressed BC2-tagged proteins were foundin the bound fractions after precipitation with the BC2 nanotrap (FIG. 9c).

In summary, the results show that the BC2T/BC2-Nb affinity systemenables a robust and convenient one-step purification of BC2 taggedrecombinant proteins from different expression systems under both native(i.e. non-denaturing) and denaturing conditions. In combination with theobserved functionality in immunoblot detection, the system covers thefull range of capture and detection of BC2-tagged proteins for a largerange of biochemical analyses.

Example 5 Immunocytochemistry Using Fluorescently Labeled BC2-Nb

Numerous nanobodies have been described for molecular imaging ofdisease-relevant antigens located on cellular surfaces [12-14]. Thereare very few studies so far which have used nanobodies for cellularimaging [15]. Therefore it was tested whether the fluorescently labeledBC2-Nb (coupled to the organic dyes AlexaFluor488 or ATTO647(BC2-NbAF488; BC2-NbATTO647)) is suitable to visualize BC2-taggedcellular proteins. To generate relevant cellular target structures thepreviously described fusion constructs mCherry-VIMBC2T or GFP-PCNABC2Twere expressed in human cells (FIG. 4 a). Fluorescent constructs ofvimentin become incorporated into the cellular intermediate filamentnetwork and visualize vimentin fibers in the cytoplasm upon transientcellular expression [16], whereas GFP-PCNA is found at sites of DNAreplication, forming characteristic spot-like structures in the nucleusduring the S phase of the cell cycle [17]. The characteristic pattern ofthe applied fusion proteins allows the performing of co-localizationstudies using immunocytochemistry with the dye-labeled BC2-Nbs.

By staining of HeLa cells expressing mCherry-VIM_(BC2T) with theBC2-Nb_(AF488) a strong co-localization of the green nanobody signalalong cytoplasmic vimentin structures shown in the red channel (FIG. 4b, FIG. 10 a) was observed. Correspondingly, the BC2-Nb_(ATTO647)revealed a clear co-localization with GFP-PCNA_(BC2T) at replicationfoci during the S phase. The signals of both GFP-PCNA_(BC2T) andBC2-Nb_(ATTO647) were exclusively found in the nucleus, demonstratingthe binding specificity of BC2-Nb. No nanobody signal was detected incells expressing mCherry-VIM or GFP-PCNA constructs lacking the BC2-tag(FIG. 10 b) These data demonstrate that the fluorescently labeled BC2-Nbspecifically binds to ectopically expressed BC2T fusion proteins and cantherefore be applied for direct antigen detection inimmunocytochemistry.

Example 6 Analysis of the Impact of Multiple Mutations or TruncationsWithin the BC2 Tag on the Binding Efficiency of the BC2-Nb in PulldownAssays

As outlined in Example 3 it was demonstrated that BC2T residues atpositions 1, 2, 4, 5, 7, 9, 11 and 12 of SEQ ID NO: 4 can be replaced.This was shown by using positional scanning peptide libraries displayingall 20 proteinogenic amino acids in one single position of the peptide.

To study the impact of multiple exchanges of amino acid residues withinthe BC2T on the binding performance in more detail, a mutated version ofthe BC2T was generated by exchanging six amino acid residuessimultaneously. Hence, Asp (D) at Position 2 to a Val (V), Lys (K) atPosition 4 to a Ser (S); Val (V) at Position 7 to Leu (L); His (H) atPosition 9 to Gln (Q); Gln (Q) at Position 11 to Ser (S) and Gln (Q) atPosition 12 to Ser (S) were replaced. By this in total 50% of the aminoacid residues originally identified as the BC2T sequence (SEQ ID NO:5)were changed. The resulting amino acid sequence is called BC2Tmut. Totest whether the simultaneous exchange of these amino acid residuesaffects the pulldown efficiency of proteins comprising the mutatedBC2-tag (BC2mut), the BC2Tmut was genetically fused to GFP(GFP_(BC2Tmut)). For pulldown analysis the soluble protein fractions ofE. coli cells expressing either wtGFP (control), C-terminally BC2-taggedGFP (GFP_(BC2T)) or GFP_(BC2Tmut) were incubated with the BC2 nanotrapand the input, non-bound and bound fractions were analyzed by SDS-PAGEfollowed by Coomassie staining and immunoblotting (see FIG. 11). Theobtained data shows that the BC2 nanotrap quantitatively precipitatesGFP_(BC2T) as well as GFP_(BC2Tmut). From that it can be concluded thata multiple exchange of amino acid residues at the indicated positionsdoes not affect the interaction and binding capacity of mutated BC2T tothe BC2 nanotrap.

Next, it was analyzed whether N- or C-terminal truncation of the BC2Tdoes affect the binding properties to the BC2-Nb. Therefore twoconstructs were generated. In the first construct (called BC2T-10) thelast two Gln (Q) residues located at the C-terminus of the BC2T weredeleted, resulting in the sequence PDRKAAVSHW (SEQ ID NO: 14). For asecond construct the first two amino acid residues Pro (P) and Asp (D)located on the N-terminus of the BC2T (BC2T-8; RKAAVSHW) wereadditionally deleted. For binding studies the BC2T-10 and the BC2T-8were genetically fused to GFP (GFP_(BC2T-10); GFP_(BC2T-8)). Forpulldown analysis the soluble protein fractions of E. coli cellsexpressing either wtGFP (control), C-terminally BC2-tagged GFP(GFP_(BC2T)), GFP_(BC2T-10) or GFP_(BC2T-8) were incubated with the BC2nanotrap and the input, non-bound and bound fractions were analyzed bySDS-PAGE followed by immunoblotting using anti-GFP antibodies (see FIG.12). The obtained data shows that the BC2 nanotrap precipitatesGFP_(BC2T-10) as well as GFP_(BC2T-8) in a comparable manner as GFPtagged with the original BC2T (GFP_(BC2T)). This demonstrates thatdeletion of the two amino acid residues flanking the BC2T either at theN- or the C-terminus does not affect the interaction and bindingcapacity of the truncated BC2T to the BC2 nanotrap.

Example 7 Development of Improved Variations of BC2T

Using a rational design approach, variations of BC2T were developed withthe aim to further improve the affinity of the BC2 tag to the BC2-Nb. Tothis end, BC2T residues at positions 1, 4, 5, 11 and 12 of SEQ ID NO: 4were substituted based on structural and biochemical data reported inFIG. 1 and Table 4 and on molecular modelling.

Two variations of BC2T, called pTag1 SEQ ID NO:33 and pTag2 SEQ ID NO:34, were designed that were found to display higher affinity to BC2-Nbthan the original BC2T sequence (Table 5, FIG. 18). Using biolayerinterferometry, the dissociation constant K_(D) was determined to be 0.7nM and 1.9 nM for the interaction between BC2-Nb and pTag1 SEQ ID NO: 33and pTag2 SEQ ID NO:34, respectively. In contrast, for original BC2T(SEQ ID NO:4), the K_(D) for the interaction with BC2-Nb is 2.6 nM (asdetermined using biolayer interferometry). In particular, thedissociation rate constant k_(off) is 7× slower for pTag1 SEQ ID NO:33than for original BC2T (Table 5), explaining the unusually high affinityof BC2-Nb for pTag1 SEQ ID NO:34.

These improved variations of BC2T were validated for the application inprotein purification and immunoprecipitation (FIG. 19). The proteinmCherry was fused with the BC2T variations pTag1 SEQ ID NO: 33 and ptag2SEQ ID NO:34 (and also the original BC2T SEQ ID NO:4 for comparison)either at the N-terminus or the C-terminus and incubated with the BC2nanotrap (FIG. 19 a) Subsequently, captured fusion protein was elutedusing the corresponding peptide at a concentration of 100 μM. SDS-PAGEanalysis of fractions of input, non-bound protein, eluted protein andprotein still bound to BC2 nanotrap after elution (FIG. 19 a) shows thatthe improved BC2 tags enable the capture of a fusion proteinirrespective of the localisation of the tag. Also, captured protein maybe eluted using free peptide, i.e. under native, non-denaturingconditions. Thus, the improved variations of BC2T may be applied toone-step protein purification of a protein of interest fused to such aBC2T variation. Compared to original BC2T, the higher affinity of theimproved BC2T variations to BC2-NB leads to a more efficient capture ofthe tagged protein, with up to 100% protein bound (FIG. 19 a).

A protein of interest fused to an improved variation of BC2T may beprecipitated from a range of organisms using the BC2 nanotrap. As isshow in in FIG. 19 b, the N-terminal fusion of pTag1 SEQ ID NO.33 to theprotein mCherry allows the highly specific recovery of this fusionprotein from lysates of the human cell line HEK293T, the Trichoplusia niinsect cell line High5 and the yeast Saccharomyces cerevisiae using theBC2 nanotrap. This experiment (FIG. 19 b) uses cell lines common in theart, which exemplifies the applicability of the BC2 nanotrap inconjunction with protein-tagging icing BC2T variation pTag1 SEQ ID NO:33to protein purification and precipitation from relevant systems ofrecombinant expression.

TABLE 5 Binding kinetics of two improved variations of BC2T compared towildtype BC2T K_(D) Sequence (nM) k_(on) (M⁻¹s⁻¹) k_(off) (s⁻¹) χ² R²pTag1 0.7 9.2 ± 0.5 × 10⁴ 6.8 ± 0.4 × 10⁻⁵ 0.09 0.9997 pTag2 1.9 1.1 ±0.1 × 10⁵ 2.0 ± 0.1 × 10⁻⁴ 0.17 0.9992 BC2T 2.6 1.9 ± 0.1 × 10⁵ 5.0 ±0.1 × 10⁻⁴ 0.18 0.9987

Table legend: The binding of BC2-Nb to BC2T (SEQ ID NO:4) and the twoimproved variations (pTag1 SEQ ID NO:33 and pTag2 SEQ ID NO:34) thereofwas analysed using biolayer interferometry (see FIG. 13 for raw data).Listed are the dissociation constant (K_(D)), the association (k_(off))and dissociation (k_(off)) rate constants and markers of the quality offit of a 1:1 binding model (χ², R²). The constants reported for originalBC2T differ from those listed in Table 1 because kinetics were analysedusing a different method.

In summary, a rational design approach enabled the development ofimproved variations of BC2T that are characterised by higher affinity toBC2-Nb. These improved variations may be fused to the N-terminus orC-terminus of a protein of interest and allow the efficient purificationof such a fusion protein from a variety of organisms. Also, likeoriginal BC2T, these improved BC2T tags can be eluted of the BC2nanotrap using native, non-denaturing conditions.

Example 8 Materials and Methods

Expression Plasmids

For bacterial expression of C-terminal BC2-tagged GFP (GFP_(BC2T))pEGFP-C1 (Clontech) was used as template. The sequence encodingGFP_(BC2T) was amplified by polymerase chain reaction (PCR) using theoligonucleotide primers GFP_(BC2T_)for (5′-GCA CCA TOG ATG GTG AGC AAGGGC GAG G-3′; SEQ ID NO:15) and GFP_(BC2T_)rev (5′-GAC GTC GAC TTA CTGCTG CCA GTG ACT AAC A-3′; SEQ ID NO:16). The PCR fragment was clonedinto the NcoI/SaII restriction sites of pTRC2A (Life Technologies). Forexpression of the C-terminal BC2-tagged GFP with a N-terminal His₆-tag(His₆-GFP_(BC2T)) the sequence encoding GFP_(BC2T) was amplified by PCRusing the oligonucleotide primers His₆-GFP_(BC2T_)for (5′-CAG GGA TCCGAG TGA GCA AGG GC-3′; SEQ ID NO:17) and His₆-GFP_(BC2T_)rev (5′-CAG GGTACC TTA CTG CTG CCA GTG ACT AA-3′; SEQ ID NO:18). The PCR fragment wascloned into BamHI/KpnI restriction sites of pRSET B (Invitrogen) addingan N-terminal His₆-tag.

To generate a GFP fusion construct comprising a 50% modified BC2tagpEGFP-C1 (Clontech) was used as template. The sequence encodingGFP_(BC2Tmut) was amplified by polymerase chain reaction (PCR) using theoligonucleotide primers GFP_(BC2Tmut_)for (5′-GGA TCC GAT GOT GAG CAAGGG CGA G-3′; SEQ ID NO:19) and GFP_(BC2Tmut_)rev (5′-GGT ACC TTA GOTOCT CCA CTG GOT CAG CGC CGC GCT CCG GAC COG CTT GTA CAG CTC GTC CATGGV3′; SEQ ID NO:20). The PCR fragment was cloned into the BamHI/KpnIrestriction sites of the previously generated GFP_(BC2T) expressionconstruct.

To generate a bacterial expression vector encoding GFP with a N-terminalHis₆-tag and a C-terminal shortened version of the BC2T (BC2T-10 orBC2T-8) the sequence encoding GFP_(BC2T-10) or GRP_(BC2T-8) wasamplified by PCR using pEGFP-C1 (Clontech) as template. For PCR thefollowing oligonucleotide primers were used. His₆-GFP_(BC2T-10_)for(5′-cccc GGA TCC GAT GGT GAG CAA GGG CGA GG-3′; SEQ ID NO:21) andHis₆-GFP_(BC2T-10_)rev (5′-cccc GGT ACC TTA CCA ATG TGA CAC CGC TCC TTTGCG GTC AGG CTT GTA CAG CTC GTC CAT GCC-3′; SEQ ID NO:22) orHis₆-GFP_(BC2T-8_)for (5′-cccc GGA TCC GAT GGT GAG CAA GGG CGA GG-3′;SEQ ID NO:23) and His₆-GFP_(BC2T-8_)rev (5′-cccc GGT ACC TTA CCA GTG GGAAAC GGC TGC TTT ACG CTT GTA CAG CTC GTC CAT G-3′; SEQ ID NO:24).

For mammalian expression of C-terminal BC2-tagged GFP pEGFP-C1 was usedas template. The sequence encoding the mammalian eGP_(BC2T) constructwas amplified by PCR using the oligonucleotide primers eGFP_(BC2T_)for(5′-AAG CTA GCG CTA CCG GTC GCC ACC ATG-3′; SEQ ID NO:25) andeGFP_(BC2T_)rev (5′-AAG GTA CCT TAT TGC TGC CAG TGA CTA AC A GCC GCT TTTCTG TCT GGC TTG TAC AGC TCG TC-3′; SEQ ID NO:26). The PCR fragment wascloned into the NheI/KpnI site of the pEGFP-C1 vector.

For mammalian expression of N-terminal BC2-tagged GFP (_(BC2T)GFP) thenucleotide sequence encoding the BC2-Tag (5-GCT AGC ATG CCC GAT COT AAGGCT GCG GrC TCT CAT TGG CAA CAG AGA TCT-3′; SEQ ID NO:27) harboring NheIand BglII restriction sites respectively was synthesized (MWG).Subsequently the tag was cloned into NheI/BglII sites of pEGFP-Nl(Clontech). The obtained plasmid was digested with XhoI and NheI,blunted using the Klenow enzyme (Roche) and re-ligated resulting in thedesired construct.

For mammalian expression of mCherry-Vimentin_(BC2T) a mCherry-Vimentinconstruct [18] was used as template. The nucleotide sequence encodingmCherry-Vimentin_(BC2T) was amplified by PCR using the oligonucleotideprimers mCherry-Vimentin_(BC2T_)for (5′-AAA AGC TTA GGT GGA GGA GGT TCTTCC ACC AGG TCC GTG TC-3′; SEQ ID NO:28) and mCherry-Vimentin_(BC2T_)rev(5′-AAG GTA CCC TAT TGC TGC CAG TGA CTA AC A GCC GCT TTT CTG TCT GGT TCAAGG TCA TCG TG-3′; SEQ ID NO:29). The PCR fragment was cloned into theHindIII/KpnI sites of the mCherry-Vimentin vector.

For mammalian expression of GFP-PCNA_(BC2T) GFP-PCNA [17] was used astemplate. The sequence encoding GFP-PCNA_(BC2T) was amplified by PCRusing the oligonucleotide primers GFP-PCNA_(BC2T_)for (5′-GTA TCK) CTTCGT GGG GAT CCC CG-3′; SEQ ID NO:30) and GFP-PCNA_(BC2T_)rev (5′-GGG GTCTAG ACT AAA GGT ACC CTA TTG CTG CCA GTG ACT AAC AGC CGC TTT TCT GTC TGGAGA TCC TTC TTC ATC CTC-3′; SEQ ID NO:31). The PCR fragment was clonedinto the BamHI XbaI restriction sites of GFP-PCNA vector.

To generate a bacterial expression vector encoding mCherry with anN-terminal His₆-tag and a C-terminal BC2T or variations thereof, thesequences encoding BC2T (SEQ ID NO:4) and the variations pTag1 SEQ IDNO:33 and pTag2 SEQ ID NO:34 were synthetized as oligonucleotides,annealed and cloned into the BrsGI and HinIII restriction sites of theplasmid vector pRSFf-B_mCherry (ThermoFisher Scientific). To this end,following oligonucleotides were used: mCherry_BC2T_fw (5′-GTA CAG TGGTCC GGA TCG CAA AGC GGC GGT GAG CCA TTG GCA GCA GTA AA-3′; SEQ ID NO:35)and mCherry_BC2_rv (5′-AGC TTT TAC TGC TGC CAA TGG CTC ACC GCC GCT TTGCGA TCC GGA CCA CT-3′; SEQ ID NO:36); mCherry_SEQ33_fw (5′-GTA CAG TGGTCC GGA TCG CGT GCG CGC GGT GAG CCA TTG GAG CAG CTA AA-3′; SEQ ID NO:37)and mCherry_SEQ33 rv (5′-AGC TTT TAG CTG CTC CAA TGG CTC ACC GCG CGC ACGCGA TCC GGA CCA CT-3′; SEQ ID NO:38): mCherry_SEQ34_fw (5′-GTA CAGT GGTGCG GAT CGC GTG CGC GCG GTG AGC CAT TGG AGC AGC TAA A-3′; SEQ ID NO:39)and mCherry_SEQ34_rv (5′-AGC TTT TAG CTG CTC CAA TGG CTC ACC GCG CGC ACGCGA TCC GCA CCA CT-3′; SEQ ID NO:40).

To generate a bacterial expression vector encoding mCherry with aC-terminal His₆-tag and an N-terminal BC2T or variations thereof, thesequences encoding BC2T (SEQ ID NO:4) and the variations pTag1 SEQ IDNO:33 and pTag2 SEQ ID NO:34 were synthetized as oligonucleotides,annealed and cloned into the NdeI and BamHI restriction sites of amodified variation of the plasmid vector pRSET-B_mCherry (ThermoFisherScientific) that encodes a C-terminal His₆-tag. To this end, followingoligonucleotides were used: mCherry_BC2T_fw2 (5′-TAT GCC GGA TCG CAA AGCGGC GGT GAG CCA TTG GCA GCA GGG CTC G-3′; SEQ ID NO:41) andmCherry_BC2T_rv2 (5′-GAT CCG AGC CCT GCT GCC AAT GGC TCA CCG CCG CTT TGCGAT CCG GCA-3′; SEQ ID NO:42); mCherry_SEQ33_fw2 (5′-TAT GCC GGA TCG CGTGCG CGC GGT GAG CCA TTG GAG CAG CGG CTC G-3′; SEQ ID NO:43) andmCherry_SEQ33_rv2 (5′-GAT CCG AGC CGC TGC TCC AAT GGC TCA CCG CGC GCACGC GAT CCG GCA-3′; SEQ ID NO:44); mCherry_SEQ34_fw2 (5′-TAT GGC GGA TCGCGT GCG CGC GGT GAG CCA TTG GAG CAG CGG CTC G-3′; SEQ ID NO:45) andmCherry_SEQ34_rv2 (5′-GAT CCG AGC CGC TGC TCC AAT GGC TCA CCG CGC GCACGC GAT CCG CCA-3′; SEQ ID NO:46).

Protein Production and Purification

Expression and purification of the BC2 nanobody was performed asdescribed previously [11]. Expression and purification of wtGFP ormodified versions thereof was performed as described previously [19].Purity of all proteins was evaluated to be at least 95% based uponsodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE)analysis. Protein concentration was spectroscopically determined.

Complex Formation

For complex formation, a BC2-Nb solution at 2 mg/ml was mixed andincubated with a threefold molar excess of peptide in 10 mM Tris/HCl pH7.4, 100 mM NaCl buffer for 1 h at room temperature. Excess of peptidewas removed via size exclusion chromatography (Superdex 200 increase, GEHealthcare). Complex formation was confirmed by liquid chromatographymass spectrometry using a Shimadzu LCMS 2020 with a Phenomenex Kinetex(2.6 u C18 100 Å) column. The complex was concentrated to 13.7 mg/ml andused for crystallization experiments.

Crystallization

Unliganded BC2-Nb was crystallized using hanging drop vapor diffusion bymixing 3.2 mg/ml protein solution with crystallization buffer (0.1 MMES/imidazole pH 6.7, 12.5% [v/v]2-Methyl-2,4-pentanediol (MPD), 7.5%(w/v) PEG 1000, 7.5% [w/v] PEG 3350, 3 mM alcohol mix) in a 1:1 ratio at20° C. The BC2-Nb BC2T complex was crystallized using hanging drop vapordiffusion, mixing 13.7 mg/ml complex solution with crystallizationbuffer (0.1 M MES/imidazole pH 6.5, 12.5% [v/v] 2-Methyl-2,4-pentanediol(MPD), 12.5% [w/v] PEG 1000, 12.5% [w/v] PEG 3350, 4 mM amino acid mix)in a 1:1 ratio at 20° C. In both cases, crystals were transferred intocrystallization solution containing 30% [v/v] MPD for cryoprotection,and flash cooled in liquid nitrogen after incubation for 30 s. OneDataset from BC2-Nb and four dataset from the same crystal of BC2-NbBC2T complex at different positions were collected with a beamwavelength of 0.918409 Å at beamline MX 14.2 of BESSY II at theHelmholtz-Zentrum Berlin (HZB).

X-ray data were reduced and in the case of BC2-Nb BC2T complex, mergedusing the XDS package [20]. Initial phases for the BC2-Nb data wereobtained by molecular replacement using PHASER [21] with a CHAINSAW[22.23] modified model of a nanobody (PDB ID:2X1O) containing only thecore region. The structure of the BC2-Nb BC2T complex was then solvedusing the unliganded BC2-Nb structure as a search model in molecularreplacement. Both structures were refined using PHENIX refine [24],REFMAC5 [25] and COOT [26]. The two structures were validated withMOLPROBITY [27]. The Ramachandran plot shows 100% (BC2-Nb), 98.1%(BC2-Nb/BC2T complex) in favored and 100% in allowed regions.

Structure Visualization and Analyzation

Superpositions of structures and calculation of RMSD values wereconducted using the Phenix structure comparison. Images of crystalstructures were prepared with PyMol.

Mass Spectrometry Analysis of Binding Specificities with SyntheticPositional Scanning Peptide Libraries

To investigate the binding specificity of the BC2T and derived mutantspeptide libraries for each amino acid position of the sequencePDRKAAVSHWQQ (SEQ ID NO:4) were synthesized with acetyl and amide groupslocated at the N-termini or C-termini respectively (Intavis).Precipitation studies were carried out incubating 60 pmol peptide of asingle position library with 2 μl BC2-Nb immobilized on agarose beads.Incubations were performed in 300 μl PBS/0.01% CHAPS for 1 h on aHulaMixer (Life Technologies). Subsequent to a centrifugation step 10 μlsupernatant were analyzed in a LC-MS procedure. Peptides were separatedusing an UltiMate3000 RSLCnano System (Thermo Scientific), composed of aC18 PepMap100 μ-Precolumn (300 μm×5 mm; particle size: 5 μm; pore size100 Å—Thermo Scientific) and a C18 analytical column (Acclaim RapidSeparation LC (RSLC) Column: 150 mm×5 mm; particle size: 2.2 μm; poresize: 100 Å—Thermo Scientific). A step gradient was applied starting at8% and ending after 20 min at 30% eluent B (80% acetonitrile, 20% H₂O,0.1% formic acid). Peptides were analyzed using a FULL-MS-strategydetected by a Q Exactive Plus mass spectrometer (Thermo Scientific). Asmaximal injection lime 100 ms was chosen while setting the AGC target to3E6. The resolution was set to 70.000. Half-maximal signal areas werereferenced to control precipitation approaches using a GFP-specificnanobody immobilized on agarose beads. All experiments were done intriplicates.

Cell Culture and Transfection

HEK293T and HeLa cells were cultivated in DMEM (high glucose, pyruvate)supplemented with 10% FCS, 2 mM L-glutamine and Pen Strep (all fromGibco, Life Technologies). Cells were cultivated at 37° C. in ahumidified chamber with a 5% CO₂ atmosphere and were trypsinized forpassaging. To generate DNA/PEI complexes for transient transfection inP100 dishes, 24 μg DNA were mixed with 108 μl polyethyleneimine (PEI,Sigma Aldrich) prediluted with 750 μOpti-MEM (Gibco, Life Technologies)and incubated for 10 min at RT. For transfection of cells in a 96-wellplate, 200 ng of DNA and 1.5 μl of PEI was used.

Generation of Soluble Protein Fraction from Bacterial Cells

Pellets of E. coli cells expressing GFP or GFP_(BC2T) derived from 1 Lculture were homogenized for 90 min at 4° C. in 500 μl PBS containing0.1 mg/ml lysozyme, 5 μg/ml DNaseI, 50 μg/ml PMSF and 1× proteaseinhibitor mix B (Serva) followed by sonication (10×10 sec pulses) Aftera centrifugation step (10 min at 18.000×g. 40° C.), the soluble proteinfraction was transferred into a new cup and the protein concentration ofeach lysate was determined using Coomassie Plus according to themanufacturer's protocol (Thermo Fisher Scientific).

Surface Plasmon Resonance

The affinity measurements of the BC2 nanobody and the indicated mutantsthereof to GFP_(BC2T) were performed using surface plasmon resonancespectroscopy with a Biacore 3000 instrument (GE-Healthcare). GFP_(BC2T)was covalently coupled on dextran fibers of a CM5 sensorchip (GEHealthcare) to a response level of 500 RU. One flow cells was activatedand blocked in the absence of protein to determine background, anotherwas loaded with untagged GFP as a control against unspecific binding.For kinetic measurement, six concentrations ranging from 7.8125 nM to250 nM of either BC2-Nb, BC2-Nb_(R106S) or BC2-Nb_(R106E) were injected.Each measurement was done in duplicates. As running dilution buffer 10mM HEPES pH 7.4, 150 mM NaCl, 0.5% surfactant Tween was used.Measurements were performed at 25° C. For the association of BC2-Nb aHow rate of 50 μl/min for 15 s and for the dissociation a flow rate of50 μl/min for 600 s was applied. The regeneration was induced byinjection of 23 μl regeneration solution at a flow rate of 30 μl/min. Asregeneration solution 10 mM glycine-HCl pH 2.0 was used. The data wasevaluated using the software Bia evaluation 4.1 and the 1:1 Langmuirbinding model with mass transfer.

SDS-PAGE and Immunoblotting

Denaturing polyacrylamide gel electrophoresis (SDS-PAGE) was performedaccording to standard procedures Protein samples were boiled in 2×SDS-sample buffer (60 mM Tris/HCl, pH 6.8; 2% (w/v) SDS; 5% (v/v)2-mercaploethanol, 10% (v/v) glycerol, 0.02% bromophenole blue). Forimmunoblotting proteins were transferred on nitrocellulose membranes(Bio-Rad Laboratories).

Antibodies

For immunoblotting the following primary antibodies were used: anti-GFPclone 3H9 (ChromoTek), anti-PCNA clone 16D10 (ChromoTek), anti-Vimentinclone V9 (Sigma Aldrich), anti-GAPDH (abeam), anti-β-eaten in clone 14(BD-Biosciences). For detection fluorophore-labeled species-specificsecondary antibodies (Alexa-647, goat-anti-rabbit, goat-anti-rat;goat-anti-mouse Life Technologies) were used. Blots were scanned on theTyphoon-Trio laser scanner (GE Healthcare).

Immunoprecipitation of BC2-Tagged Proteins

HEK293T cells transiently expressing eGFP, eGFP_(BC2T) or _(BC2T)eGFP,eGFP-PCNA, eGFP-PCNA_(BC2T), mCherry-Vimentin, mCherry-Vimentin_(BC2T)were washed and harvested in phosphate buffered saline (PBS),snap-frozen in liquid nitrogen and stored at −20° C. Cell pellets werehomogenized in 200 μl lysis buffer (10 mM Tris/Cl pH 7.3, 150 mM NaCl,0.5% NP40, 1 μg DNaseI, 2 mM MgCl₂, 2 mM PMSF, 1× protease inhibitor mixM (Serva) by repealed pipetting for 40 min on ice. After acentrifugation step (10 min at 18.000×g) the protein concentration ofeach lysate was determined using a Pierce BCA Protein Assay Kit (ThermoFisher Scientific) according to manufacturer's protocol and the proteinsolutions were adjusted with dilution buffer (10 mM Tris/Cl pH 7.5, 150mM NaCl, 2 mM PMSF) to a concentration of 1 mg/ml, 2% of the supernatantwere added to SDS-containing sample buffer (referred to as input). 50 μlBC2 nanotrap (slurry) per 100 μl supernatant were added followed byincubation for 1 h on an end-over-end rotor at 4° C. After acentrifugation step (2 min, 2500×g) the precleared supernatant wastransferred to a new cup. 2% were added to SDS-containing sample buffer(referred to as non-bound). After four washing steps, BC2 nanotrap boundproteins were eluted by boiling the beads in 50 μl SDS-sample buffer orby incubation with indicated elution buffers. Samples were analyzed bySDS-PAGE followed by immunoblotting. Immunoblots were probed withindicated antibodies

Immunoprecipitation at Harsh Conditions

For each condition, 50 μl bead-slurry were mixed with 100 μl of solubleprotein fraction from E. coli lysate (c=1 mg/ml) and 150 μl of asolution containing either a detergent or a chaotropic agent, resultingin a final concentration as indicated below. As chaotropic agents urea(0 M, 1 M, 2 M, 4 M) and guanidinium chloride (0 M, 0.375 M, 0.75 M, 1.5M, 3 M) were used and as a detergent SDS (0%, 0.1%, 0.5%, 1%, 2%) wasused. After incubation on an end-over-end rotor for one hour at 4° C.and a centrifugation step (2 min, 2500×g, 4° C.) the supernatants werediscarded and the remaining beads were washed twice in PBS beforeboiling in 50 μl 2×sample buffer. 10% of each bead bound (B) fractionwere analyzed by SDS-PAGE and immunoblotting using an anti-GFP antibody.

Elution of Bound BC2-Tagged Protein

For the elution experiments 40 μl of BC2 nanotrap (slurry) wereincubated with 400 μl soluble protein extract (c=1 mg/ml) derived fromE. coli cells expressing GFP_(BC2T) for 1 h at 4° C. on an end-over-endrotor. After a centrifugation step (2500×g, 4° C., 2 min) thesupernatant was discarded and the beads were washed four times inice-cold PBS including a cup change after the second washing step.Subsequently, beads were pelleted and incubated with 80 μl of theindicated elution conditions for 15 min at RT. The following elutionconditions have been tested, peptide elution 0 mM, 0.01 mM. 0.1 mM or 1mM BC2-peptide dissolved in in 0.2 M Tris/Cl pH 7.4, 150 mM NaCl; acidicelution: 0.2 M Glycine-HCl adjusted to pH 1. pH 2 or pH 3; alkalineelution: 0 mM. 1 mM, 10 mM or 100 mM NaOH; elution with sodiumthiocyanate. 0 M, 1 M, 2 M, or 3 M NaSCN. The eluates were collected andboiled in 1×SDS containing sample buffer and the remaining beads werewashed twice in PBS before boiling in 40 μl 2× sample buffer. 10% ofeach elution (release, R) and bead bound (bound, B) fraction wereanalyzed by SDS-PAGE and immunoblotting using an anti-GFP antibody.

Immunofluorescent Staining with Fluorescently Labeled Nanobody

Purified BC2 nanobody (1 mg) was labeled with the NHS-activatedfluorescent dyes Alexa488 (Life technologies) or Atto647 (Atto-Tec)according to manufacturer's guidelines. After coupling, unbound dye wasremoved by separation on PD-10 Desalting Columns (GE Healthcare). Forimmunoblot analysis. 5 μg of BC2-Nb_(AF488) was diluted in 4 ml 3% BSAdiluted in TBS, 0.05% Tween20 and Western blots were incubated for 1 hat RT.

For immunocytochemistry 5-10×10³ adherent HeLa cells per well of aμClear 96 well plate (Greiner) were transfected with plasmids coding forC-terminally BC2-tagged vimentin or PCNA After 24 h, cells were washedonce with PBS and fixed with 4% w/v paraformaldehyde (PFA) in PBS for 15mm at RT or with methanol for 15 mm at −20° C. After removal of thefixative and washing twice with PBS, cells were blocked andpermeabilized with 3% w/v BSA and 0.1% v/v Triton X-100 in PBS for 1 hat RT Subsequently, labeled BC2 nanobody was added with a finalconcentration of 10-15 μg/ml and 4′,6-diamidino-2-phenyl indole (DAPI,Sigma Aldrich) with a final concentration of 1 μg/ml for overnightincubation at 4° C. Unbound nanobody and DAPI were removed by washing 3limes with a mixture of PBS and 6% v/v 5 M NaCl in H₂O. Images of FIG. 4b and FIG. 10 b were acquired with an Image Xpress micro XL system.Confocal images of FIG. 10 a were acquired with an ImageXpress MicroConfocal.

Fluorescence Spectroscopy

Fluorescence assay s were performed by scanning a 96 well microplate(Nunc) on a Typhoon Trio (GE Healthcare Life Sciences), excitation: 488nm, emission filter settings: 520 nm BP 40

Biolayer Interferometry

The affinity measurements of BC2-Nb to BC2T and improved variationspTag1 and pTag2 were performed using biolayer interferometry using aBLItz system instrument (fortéBIO). Biotinylated peptide of BC2T and itsvariations were immobilised on Streptavidin biosensors (fortéBIO). Theloaded sensor was sequentially dipped into BLItz buffer (phosphatebuffered saline, 0.1% (m/v) BSA, 0.02% (v/v) Tween20) for 120 s todetermine base line signal, in a solution of BC2-Nb in BLItz buffer(concentration of 5-50 nM) for 120 s to determine association and inBLItz buffer for 600 s to determine dissociation. The shaking rate was1200 rpm. As a reference, each peptide was also incubated with BLItzbuffer only according to the protocol above. In a control experiment,Streptavidin biosensors were incubated with BC2-Nb without prior peptideimmobilisation. The sensor was regenerated by sequential dips into 10 mMglycine pH 2.0 and BLItz buffer (repeated three times). Raw data wereanalysed using the software BLItz Pro (fortéBio) and a 1:1 bindingmodel.

Example 9 BC2 Nanobody Generation

Identification of BC2-Nanobody

Material & Methods

V_(H)H Libraries

Alpaca immunizations with purified β-catenin protein and V_(H)H-libraryconstruction were carried out as described previously in Rothbauer etal., (2006) (Targeting and tracing antigens in live cells withfluorescent nanobodies. Nature methods 3, 887-889). Animal immunizationhas been approved by the government of Upper Bavaria (Permit number:55.2-1-54-2531.6-9-06). In brief, six weeks after immunization of twoanimals (Vicugna pacos) with either GST-β-catenin or C-terminalhistidine-tagged β-catenin (β-catenin-His₆), ˜100 ml blood werecollected and lymphocytes were isolated by Ficoll gradientcentrifugation using the Lymphocyte Separation Medium (PAA LaboratoriesGmbH) Total RNA was extracted using TRIzol (Life Technologies) and mRNAwas reverse transcribed to cDNA using a First-Strand cDNA Synthesis Kit(GE Healthcare). The V_(H)H repertoire was isolated in 3 subsequent PCRreactions using following primer combinations (I) CALL001 (5′-GTC CTGGCT GCT CTT CTA CA A GG-3′; SEQ ID NO:47) and CALL002 (5′-GGT ACG TGCTGT TGA ACT GTT CC-3′; SEQ ID NO:48) (2) SM017 and SM018 (5′-CCA GCC GGCCAT GGC TCA GGT GCA GCT GGT GGA GTC TGG-3′; SEQ ID NO:49, and 5′-CCA GCCGGC CAT GGC TGA TGT GCA GCT GGT GGA GTC TGG-3′; SEQ ID NO:50,respectively) and reverse primer CALL002 and (3) A4short (5′-CAT GCC ATGACT CGC GGC CAC GCC GGC CAT GGC-3′; SEQ ID NO:51) and reverse Primer 38(5′-GGA CTA GTG CGG CCG CTG GAG ACG GTG ACC TGG GT-3′; SEQ ID NO:52)introducing SfiI and NotI restriction sites. The V_(H)H library wassubcloned into the SfiI/NotI sites of the pHEN4 phagemid vector (forreference see Arbabi Ghahroudi, et al., (1997) Selection andidentification of single domain antibody fragments from camelheavy-chain antibodies. FEBS Lett 414, 521-526)

V_(H)H Screening

The V_(H)H domains were expressed on phages after infecting the cells ofthe ‘immune’ library in pHEN4 with M13K07 helper phages. V_(H)H withspecificity for β-catenin were enriched by two consecutive rounds of invitro selection using full-length β-catenin coated on microtiter plates(10 μg well). Bound phages were eluted with 100 mM tri-ethylamine, TEA(pH 10.0). The eluate was immediately neutralized with 1 M Tris/HCl (pH7.4) and used to infect exponentially growing TG1 cells. The enrichmentof phage particles carrying the antigen-specific V_(H)H domains wasmonitored by comparing the number of phages eluted from wells withcaptured versus non-captured antigen. Following panning 96 individualclones of each antigen were screened by standard ELISA procedures usinga horseradish peroxidase-labeled anti-M13 monoclonal antibody(GE-Healthcare)

Expression Plasmids

For bacterial expression of V_(H)H domains (nanobodies, Nbs), sequenceswere cloned into the pHEN6 vector (for reference see Arbabi Ghahroudi etal., (1997) Selection and Identification of single domain antibodyfragments from camel heavy-chain antibodies. FEBS Lett 414, 521-526),thereby adding a C-terminal 6×His-tag for IMAC purification as describedpreviously (for reference see Rothbauer et al. (2008) A versatilenanotrap for biochemical and functional studies with fluorescent fusionproteins. Molecular & cellular proteomics: MCP 7, 282-289). Expressionand purification of β-catenin-specific Nbs was earned out as describedpreviously in Rothbauer et al., (2008) Molecular & cellular proteomics:MCP 7, 282-289).

The expression plasmids for the ββ-catenin fusions have been describedpreviously in Aberle et al. (1994) Assembly of the cadherin-catenincomplex in vitro with recombinant proteins. Journal of cell science 107(Pt 12). 3655-3663, and Luckert et al., (2011) Snapshots of proteindynamics and post-translational modifications in oneexperiment-beta-catenin and its functions. Molecular & cellularproteomics: MCP 10, M110 007377). For protein production, E. coliBL21(DE3) CodonPlus-RIL cells (Stratagene) were used.

Microsphere-Based Sandwich Immunoassays (see FIG. 13)

BC2-Nb was covalently immobilized on microspheres using a modifiedmanufacturer's protocol described in Poetz et al. ((2009)Microsphere-based co-immunoprecipitation in multiplex. Analyticalbiochemistry 395, 244-248). Immobilized BC2-Nb was incubated withpurified proteins representing full-length β-catenin or the followingdomains of β-catenin: N-terminus aa 1-119, armadillo domain aa 120-683and C-terminus aa 683-781. Purified Glutathion-S-transferase (GST) wasused as a negative control. Concentrations range from 0.25 μg/ml to 2μg/ml. Bound proteins were detected with domain-specific antibodiesincluding ABC antibody clone 8E7 (Merck Millipore) specific for theN-terminus, anti-β-catenin 9G10 (Merck Millipore) targeting thearmadillo domain and anti-β-catenin clone 14 (Becton Dickinson)specifically recognizing the C-terminus. To detect unspecific binding ofthe BC2-Nb to GST an antibody against GST (6G9, ChromoTek) was used.Secondary species-specific antibodies (anti-mouse, anti-rat,anti-rabbit) tagged with phycoerythrin (PE) (Dianova) were used forimmunodetection of antibody protein complexes. The microspheres weremeasured in a FlexMap 3D instrument (Luminex). The evaluation of thedata was done with Excel (Microsoft).

Surface Plasmon Resonance

The affinity measurement of the BC2-Nb was performed using surfaceplasmon resonance spectroscopy with a Biacore 3000 instrument(GE-Healthcare). Recombinant β-catenin was covalently coupled on dextranfibers of a CM5-chip (GE-Healthcare) according to the manufacturer'sprotocol. β-catenin was coupled to a response ranging from 1200 to 3000RU. One flow cells was activated and blocked in the absence of proteinto determine background. For kinetic measurement five concentrationsranging from 0.625 nM to 5 μM of the BC2-Nb were injected Eachmeasurement was done in duplicates. As running/dilution buffer 10 mMHEPES pH 7.4, 150 mM NaCl, 0.5% surfactant P20 was used. Measurementswere performed at 25° C. For the association of the BC2-Nb a flow rateof 30 μl/min for 3 minutes and for the dissociation a flow rate of 60μl/min for 5 minutes was applied. The dissociation was induced by twoinjections of regeneration solution of 15 μl each at the identical flowrates. As regeneration solution was used BC2: 100 mM H₃PO₄. The data wasevaluated using the software Bia evaluation 4.1 and the 1:1 Langmuirbinding model.

Immobilization of Peptides onto OVA-Microspheres

MagPlex microspheres (Luminex), with varying IDs per peptide (n=29),were coated with Imject ovalbumin (OVA) (Thermo Fisher Scientific).Coating procedure was done according to a modified manufacturer'sprotocol (see Poetz et al., ((2009) Microsphere-basedco-immunoprecipitation in multiplex. Analytical biochemistry 395,244-248). Subsequently, 250,000 microspheres of each ID were washed onetime with 1× PBS, the OVA was activated with 1660 μM sulfosuccininadyl4-[p-maleinadophenyl] butyrate (suIlfo-SMPB) (Thermo Fisher Scientific)dissolved in dimethyl sulfoxide (Roth) and diluted to the finalconcentration with 1× PBS. The activation of OVA-microspheres was donefor 1 h at RT on a plate incubator (Eppendorf) at 650 rpm. In parallel,the 29 peptides were reduced with tris(2-carboxyethyl)phosphine (TCEP).Each peptide was diluted in 1× PBS containing 40% (v/v) acetonitrile toa final concentration of 600 μM. An equimolar solution of TCEP in PBSwas prepared and one volume of each the peptide solutions and theTCEP-solution were mixed and incubated for 20 mm at RT at 250 rpm on aplate incubator. After activation, the OVA-microspheres were washed twotimes with 1× PBS 0.005% Triton X-100 to remove excess sulfo-SMPB andsolubilized in reduced peptide solutions. Coupling procedure of thepeptides to OVA was done for 1 h at RT with continuous shaking at 650rpm. Afterwards, the microspheres were again washed two times with 1×PBS 0.005% Triton X-100 and transferred into a blocking buffercontaining 10 mg/ml BSA dissolved in 1× PBS to block all free activatedOVA. The blocking of OVA was done for 10 min at room temperature at 850rpm followed by an additional washing step with 1× PBS containing 0.005%Triton and transferred into 1× Roche-buffer (Roche) containing 0.05%sodiumazid for storage.

Epitope Mapping

For the epitope mapping, BC2-Nb was biotinylated withsulfo-NHS-LC-biotin (Thermo Fisher Scientific) using manufacturer'sprotocol. The biotinylated BC2-Nb was then used as detection reagent ina sandwich-immunoassay similar to the domain mapping procedure using thepeptide coaled OVA-microspheres to screen for specific epitopes. Asnegative controls, microspheres comprising Ovalbumin and the Myc-peptide(EQKLISEEDL) covalently coupled to OVA-microspheres were used. Thebiotinylated BC2-Nb was applied at a concentration of 1 μg/ml.Peptide-bound BC2-Nb was detected with 2.5 μg-mlstreptavidin-phycoerythrin (PE) solution (Prozyme) dissolved in 1× Rochebuffer 0.05% Tween. Evaluation of the PE-signals w ere done in a Flexmap3D (Luminex).

Results

To generate β-catenin-specific nanobodies, two alpacas (Vicugna pacos)were immunized with purified recombinant human β-catenin. A phagemidlibrary was generated comprising ˜2×10⁷ clones representing the fullrepertoire of the variable heavy chains of heavy chain antibodies(V_(H)Hs or nanobodies, Nbs) derived from one animal. The library wassubjected to phage display and biopanning was performed usingfull-length β-catenin. Two subsequent phage display cycles revealed anenrichment of 14 unique nanobody sequences which were positively testedfor antigen binding in a solid-phase phage ELISA.

Individual Nbs were cloned with a C-terminal 6×His-tag, expressed inEscherichia coli (E. coli) and purified using immobilized metal ionaffinity chromatography ((MAC) followed by size exclusionchromatography. It was tested whether the β-catenin-specific Nbs arefunctional as capture molecules in a microsphere-based sandwichimmunoassay system. To this end, the Nbs were immobilized on magneticmicrospheres (MagPlex) and were incubated with decreasing amounts ofβ-catenin Bound β-catenin was detected with a C-terminus-specificantibody (clone 14/BD).

With BC2-Nb one of the best-performing nanobody was chosen. Incombination with the C-terminal β-catenin-specific antibody as detector,bound β-catenin was detectable down to ˜1 ng/ml when using BC2 ascapture molecule (see FIG. 14).

For further characterization of the selected BC2-Nb, its affinity torecombinant β-catenin was determined using surface plasmon resonance(SPR). After immobilizing β-catenin, the association/dissociation rateswere measured by injecting serial dilutions of five differentconcentrations for BC2-Nb. By this, we determined affinities (K_(D)values) in the low nanomolar range of ˜3.1 nM, which is in accordancewith the strong binding signal observed in the microsphere-basedsandwich immunoassay experiments (See FIG. 15).

Domain and Epitope Mapping of β-Catenin Binders

Structural analysis of β-catenin revealed a tripartite structure: anegatively charged N-terminus (aa 1-aa 140), the core domain (aa 141-aa664) composed of 12 Armadillo repeats and a short C-terminus (aa 665-aa781). The amino acid sequence of human β-catenin is available in GenBankunder Accession Number NP_001091679.1, Version Number GI:148233338. Totest whether the selected BC2-Nb recognize these individual domains ofβ-catenin, a microsphere-based sandwich immunoassay was performedcapturing either full-length β-catenin or the indicated domains with theselected Nbs. With this approach, it could be shown that BC2 exclusivelyrecognizes the N-terminal domain (aa 1-119) of β-catenin (see FIG. 16).

To determine the minimal linear epitope of BC2-Nb a pepscan analysis wasperformed. For this purpose, 29 synthetic 15-mer peptides with a 11amino acids overlap between consecutive peptides representing aa 1-127of the β-catenin N-terminus were used. All peptides were immobilized viaan additional cysteine at the N-terminus of the peptide on individualmicrosphere panicles and incubated with biotinylated BC2-Nb inincreasing concentrations (for reference see Bauer et al. (2012)Identification and quantification of a new family of peptideendocannabinoids (Pepcans) showing negative allosteric modulation at CB1receptors. The Journal of biological chemistry 287, 36944-36967).

For BC2-Nb, the analysis showed strong binding to two consecutivepeptides comprising the residues 13-31 (FIG. 17 A). Further truncationrevealed that BC2-Nb recognizes the amino acid sequence PDRKAAVSHWQQ(SEQ ID NO: 4) (aa 16-27) (FIG. 17 B). This observation is quite notablesince only very few nanobodies are known to bind short linear peptidesequences (for reference see Muyldermans, S. (2013) Nanobodies: naturalsingle-domain antibodies. Annual review of biochemistry 82, 775-797).For further analyses of the BC2-Nb epitope specificity, binding studieswere performed using the identified peptide with a phosphorylated Ser23residue. This modification completely abolishes binding of BC2suggesting that this nanobody recognizes β-catenin only when Ser23 isnot phosphorylated (FIG. 17 C). In summary, the data shows that BC2-Nbbinds a linear peptide at the very N-terminus of β-catenin in aphosphorylation-dependent manner.

REFERENCES

1 De Genst, E. et al. Molecular basis for the preferential cleftrecognition by dromedary heavy-chain antibodies. Proceedings of theNational Academy of Sciences of the United States of America 103,4586-4591, doi. 10.1073/pnas.0505379103 (2006).

2 Muyldermans, S. Single domain camel antibodies, current status.Journal of biotechnology 74, 277-302 (2001).

3 Govaert, J. et al. Dual beneficial effect of interloop disulfide bondfor single domain antibody fragments. The Journal of biologicalchemistry 287, 1970-1979. doi:10.1074/jbc.M111.242818 (2012).

4 Rothbauer, U. et al. A versatile nanotrap for biochemical andfunctional studies with fluorescent fusion proteins. Molecular &cellular proteomics: MCP 7, 282-289, doi:10.1074/mcp. M700342-MCP200(2008).

5 Muyldermans, S. Nanobodies, natural single-domain antibodies Annu RevBiochem 82, 775-797, doi:10.1146/annurev-biochem-063011-092449(2013).

6 Lee, S. Y. et al. Ube3a, the E3 ubiquitin ligase causing Angelmansyndrome and linked to autism, regulates protein homeostasis through theproteasomal shuttle Rpn10. Cellular and molecular life sciences: CMLS71, 2747-2758, doi:10.1007/s00018-013-1526-7(2014).

7 Schembri, L. et al. The HA tag is cleaved and loses immunoreactivityduring apoptosis. Nature methods 4, 107-108, doi:10.1038/nmeth0207-107(2007).

8 Wegner, G. J., Lee. H. J. & Com. R. M. Characterization andoptimization of peptide arrays for the study of epitope-antibodyinteractions using surface plasmon resonance imaging. Analyticalchemistry 74, 5161-5168 (2002).

9 Hilpert, K. et al. Anti-c-myc antibody 9E10: epitope key positions andvariability-characterized using peptide spot synthesis on cellulose.Protein Eng 14, 803-806 (2001).

10 De Genst. E. J. et al. Structure and properties of a complex ofalpha-synuclein and a single-domain camelid antibody, Journal ofmolecular biology 402, 326-343. doi:10.1016j.jmb.2010.07.001 (2010).

11 Traenkle, B. et al. Monitoring interactions and dynamics ofendogenous beta-catenin with intracellular nanobodies in living cells.Molecular & cellular proteomics: MCP, doi:10.1074/mcp.M114.044016(2015).

12 Huang, L. et al. SPECT imaging with 99mTc-labeled EGFR-specificnanobody for in vivo monitoring of EGFR expression. Molecular imagingand biology: MIB: the official publication of the Academy of MolecularImaging 10, 167-175, doi:10.1007/s11307-008-0133-8 (2008).

13 Vaneycken, I. et al. Preclinical screening of anti-HER2 nanobodiesfor molecular imaging of breast cancer. FASEB journal: officialpublication of the Federation of American Societies for ExperimentalBiology 25, 2433-2446, doi:10.1096/fj.10-80331 (2011).

14 Broisat, A et al. Nanobodies targeting mouse human VCAM1 for thenuclear imaging of atherosclerotic lesions. Circulation research 110,927-937, doi:10.1161/CIRCRESAHA.112.265140 (2012).

15 Ries, J., Kaplan. C., Platonova, E., Eghlidi, H. & Ewers. H. Asimple, versatile method for GFP-based super-resolution microscopy viananobodies. Nature methods 9, 582-584, doi:10.1038/nmeth.1991 (2012).

16 Yoon, M., Moir, R. D., Prahlad, V. & Goldman, R. D. Motile propertiesof vimentin intermediate filament networks in living cells. The Journalof cell biology 143, 147-157 (1998).

17 Leonhardt, H. et al. Dynamics of DNA replication factories in livingcells. The Journal of cell biology 149, 271-280 (2000).

18 Maier, J., Traenkle, B. & Rothbauer, U. Real-time analysis ofepithelial-mesenchymal transition using fluorescent single-domainantibodies. Scientific reports 5, 13402. doi:10.1038/srep13402 (2015).

19 Kirchhofer, A. et al. Modulation of protein properties in livingcells using nanobodies. Nature structural & molecular biology 17,133-138. doi:10.1038/nsmb.1727 (2010).

20 Kabsch. W. Xds. Acta crystallographies. Section D. Biologicalcrystallography 66, 125-132, doi:10.1107/80907444909047337 (2010).

21 McCoy, A. J. et al. Phaser cryslallographic software. Journal ofapplied crystallography 40, 658-674, doi:10.1107/S0021889807021206(2007).

22 Winn, M. D. et al. Overview of the CCP4 suite and currentdevelopments. Acta crystallographies. Section D, Biologicalcrystallography 67, 235-242. doi:10.1107/S0907444910045749 (2011).

23 Stein. N. CHAINSAW: a program for mutating, pdb files used astemplates in molecular replacement. Journal of applied crystallography41, 641-643, doi:Doi 10.1107/S0021889808006985 (2008).

24 Adams, P. D. et al. PHENIX a comprehensive Python-based system formacromolecular structure solution. Acta crystallographies. Section D,Biological crystallography 66, 213-221, doi:10.1107.S0907444909052925(2010).

25 Murshudov, G. N., Vagin, A. A. & Dodson, E. J. Refinement ofmacromolecular structures by the maximum-likelihood method Actacrystallographies. Section D, Biological crystallography 53, 240-255,doi:10.1107/S0907444996012255 (1997).

26 Emsley, P. & Cowtan, K, Coot: model-building tools for moleculargraphics. Acta crystallographica. Section D. Biological crystallography60, 2126-2132. doi:10.1107/S0907444904019158 (2004).

27 Chen, V. B. et al. MolProbity: all-atom structure validation formacromolecular crystallography. Acta crystallographica. Section D,Biological crystallography 66, 12-21, doi. 10.1107/S0907444909042073(2010).

What is claimed is:
 1. A kit for purification of a polypeptidecomprising an epitope peptide, wherein the kit comprises: a) an antibodythat specifically binds to the epitope peptide, wherein the epitopepeptide comprises from about 8 to about 25 amino acids, wherein theamino acid sequence comprises a sequence as defined in SEQ ID NO: 1(RX₄X₅AX₇SX₉W), wherein X₄ can be K or a substitution; wherein X₅ can beA or R or a conservative substitution of A or R; wherein X₇ can be V ora conservative substitution of V, wherein X₉ can be H or a conservativesubstitution of H, wherein the antibody comprises an amino acid sequencehaving at least 90% identity to SEQ ID NO: 6; and b) a secondary bindingpartner, wherein the secondary binding partner is specific for theantibody, specific for the polypeptide, or specific for a complexcomprising both, and wherein the secondary binding partner comprises adetectable moiety and/or an immobilized or immobilizable moiety.
 2. Thekit of claim 1, wherein the epitope peptide comprises from about 12 toabout 25 amino acids, wherein the amino acid sequence comprises asequence as defined by SEQ ID NO: 32 (X₁X₂RX₄X₅AX₇SX₉WX₁₁X₁₂), whereinX₁ can be P or A; wherein X₂ can be D or a conservative substitution ofD; wherein X₄ can be K or a substitution; wherein X₅ can be A or R or aconservative substitution of A or R; wherein X₇ can be V or aconservative substitution of V, and wherein X₉ can be H or aconservative substitution of H; and wherein X₁₁ and X₁₂ can be Q or aconservative substitution of Q.
 3. The kit of claim 1, wherein theepitope peptide has an amino acid sequence as defined by SEQ ID NO: 3(RKAAVSHW); or an amino acid sequence as defined by SEQ ID NO: 4(PDRKAAVSHWQQ); or an amino acid sequence as defined by SEQ ID NO: 5(PVRSAALSQWSS), or an amino acid sequence as defined by SEQ ID NO:33(PDRVRAVSHWSS), or an amino acid sequence as defined by SEQ ID NO: 34(ADRVRAVSHWSS).
 4. The kit of claim 1, wherein the antibody is ananobody, wherein the nanobody comprises an amino acid sequencecomprising framework region 1, CDR1, framework region 2, CDR2, frameworkregion 3, CDR3, and framework region 4, wherein CDR3 comprises SEQ IDNO:12, and wherein a cysteine is present in the framework region 2,which is capable of forming a disulfide bridge with the cysteine presentin CDR3.
 5. The kit of claim 1, wherein the antibody is an antibodyhaving the amino acid sequence as defined in SEQ ID NO:
 6. 6. The kit ofclaim 1, wherein the antibody is attached to a solid support.
 7. The kitof claim 6, wherein the solid support is selected from the groupcomprising polystyrene, polypropylene, polyvinylchloride,polyacrylamide, celluloses, dextrans, synthetic polymers andco-polymers, latex, silica, agarose, metal, glass, and carbon.
 8. Amethod of purifying a polypeptide comprising the steps of: a) contactinga sample comprising a polypeptide that comprises an epitope peptide withthe antibody of the kit of claim 1, wherein prior to or after binding,the antibody is attached to a solid support; b) washing the solidsupport of step a) to remove unbound and unspecifically boundconstituents; and c) eluting the polypeptide.
 9. A method of purifying apolypeptide comprising the steps of: a) contacting a sample comprising apolypeptide that comprises an epitope peptide with the antibody of thekit of claim 1; b) contacting the sample obtained after step a) with asecondary binding partner specific for the antibody, specific for thepolypeptide, or specific for a complex comprising both, wherein thesecondary binding partner is attached to a solid support prior to orafter binding; c) washing the solid support of step b) to remove unboundand unspecifically bound constituents; and d) eluting the polypeptide.10. The method of claim 8, wherein the antibody is an antibody specificfor the epitope peptide, wherein the antibody is a nanobody, wherein thenanobody comprises an amino acid sequence comprising framework region 1,CDR1, framework region 2, CDR2, framework region 3, CDR3, and frameworkregion 4, wherein CDR3 comprises SEQ ID NO: 12, and wherein a cysteineis present in the framework region 2, which is capable of forming adisulfide bridge with the cysteine present in CDR3.
 11. The method ofclaim 9, wherein the antibody is an antibody specific for the epitopepeptide, wherein the antibody is a nanobody, wherein the nanobodycomprises an amino acid sequence comprising framework region 1, CDR1,framework region 2, CDR2, framework region 3, CDR3, and framework region4, wherein CDR3 comprises SEQ ID NO: 12, and wherein a cysteine ispresent in the framework region 2, which is capable of forming adisulfide bridge with the cysteine present in CDR3.
 12. The method ofclaim 8, wherein the antibody comprises SEQ ID NO:
 6. 13. The method ofclaim 9, wherein the antibody comprises SEQ ID NO: 6.